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Genetic privacy 


The ability to identify an individual from their anonymous genome sequence, using a clever 
algorithm and data from public databases, threatens the principle of subject confidentiality. 


ow private is private? A study published on 17 January reveals 

vulnerabilities in the security of public databases that contain 

genetic data, the latest in a series of similar revelations. So 
far, research funders that host the databases have responded to such 
problems on a case-by-case basis, but it is now clear that the research 
community as a whole must devise a more comprehensive approach. 

In the latest study, led by Yaniv Erlich at the Whitehead Institute for 
Biomedical Research in Cambridge, Massachusetts (M. Gymrek et al. 
Science 339, 321-324; 2013), researchers showed that they could dis- 
cover the identity of some men whose genomes had been sequenced 
as part of a genomics project (see Nature http://dx.doi.org/10.1038/ 
nature.2013.12237; 2013). Erlich’s team wrote an algorithm that infers 
an individual’s pattern (a haplotype) of genetic markers called short 
tandem repeats from the nucleotide sequence of his Y chromosome. 
The team then searched genealogical databases for the names of men 
with corresponding Y-chromosome haplotypes. The team confirmed 
the correct names by cross-referencing the possible last names with 
public records of people of similar ages and locations. 

Using this strategy, the team was able to confirm the identity ofknown 
individuals whose genomes have been sequenced, such as genomics 
entrepreneur Craig Venter, and to discover the identities of anonymous 
research subjects, including five men who participated in both the 
1000 Genomes Project and a study of Utah Mormons initiated by the 
Centre for the Study of Human Polymorphism (CEPH) in Paris. Erlich’s 
team was also able to discern the identity of some of the study subjects’ 
family members, because family pedigrees were collected as part of the 
CEPH study. 

It is important to note that the CEPH cohort is particularly suitable 
for this method of identification, because of the volume of informative 
data that has been collected and published about CEPH participants. 
Their family pedigrees, the places where they lived and their ages at 
the time of the data collection are all public information. Or at least 
they were until the US National Institute of General Medical Sciences, 
part of the National Institutes of Health (NIH), responded to Erlich’s 
study by removing participants’ ages from public view on the Human 
Genetic Cell Repository website that it funds. 

It would probably be more difficult to use Erlich’s method to identify 
participants in studies lacking extensive demographic information. And 
Erlich responded in an exemplary way to his team’s findings by contact- 
ing the NIH and other genetics researchers with his findings before 
publishing them. This sets an important precedent for constructively 
dealing with newly discovered privacy loopholes, and other researchers 
should take note. Erlich’s team is also not publishing the names of the 
anonymous study participants whose identities they uncovered. 

How the genetics community addresses these issues is crucial to how 
large-scale genetic studies will proceed. Although research participants 
are already sometimes told that their data might not remain private — as 
the CEPH study participants were — the fact that their identities could 


be revealed would seem a remote risk to them, as that has only recently 
become possible. It is now imperative that participants fully understand 
that it is unlikely that their identities can be kept hidden if their genetic 
data are revealed. Some participants might welcome this, such as those 
with an interest in genealogy. Others — perhaps those with stigmatized 
diseases, for instance — might not. 

Moving data behind a controlled-access barrier lessens their utility 
to science and to society at large. But researchers need to show the 
public that they are acting as careful stew- 


“Researchers ards of the data entrusted to them. Erlich 
need tos. how argues that the solution is to make sure that 
the public that participants understand what they’re sign- 
they are acting ing up for, and to adopt laws that adequately 
as careful protect people against the misuse of their 
stewards of the genetic information. 

data entrusted to Geneticists are brainstorming other pro- 
them.” posals for balancing data sharing with the 


need to protect the privacy of research sub- 
jects. One is to move more data behind a controlled-access barrier, but 
to authorize trusted users to access the data from many studies, rather 
than having to obtain it piecemeal from different studies, as research- 
ers must do today. There are logistical barriers to this — for instance, 
ensuring compatibility across databases. And it is debatable whether 
such restrictions might do more harm than good. 

But if controlled access is not the right solution, it is up to the 
research community, in consultation with the public, to devise a bet- 
ter one. A solution should come sooner, rather than later, because 
this latest revelation of a privacy loophole will be far from the last. m 


Vigilance needed 


Experiments that make deadly pathogens more 
dangerous demand the utmost scrutiny. 


strains of the H5N1 avian influenza virus that can transmit 

between mammals has already borne fruit. Claims of public- 
health benefits have received thorough scrutiny, and the researchers 
involved have better explained the biosafety and biosecurity precautions 
that they take. The debate has drawn attention to, and exposed gaps in, 
the rules that govern ‘dual-use’ research — work that can bring public 
benefit but might also be used for harmful purposes. The row has also, 
for example, prompted long-overdue national guidelines in the United 
States and made funders everywhere more aware of the need to assess 


r | The year-long voluntary moratorium on research to engineer 
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risky research proposals proactively. In short, the moratorium — the 
lifting of which is announced this week (see page 460) — has seen seri- 
ous thought on the complex issues involved. 

In the past year, the debate’s focus has somewhat shifted from bio- 
terrorism concerns — which, being classified, are difficult for outsiders 
to evaluate — towards biosafety issues. And it has concentrated atten- 
tion more broadly on how best to regulate ‘gain-of-functiom research: 
work intended to increase the transmissibility, host range or virulence 
of pathogens. The United States is the main funder of such research, 
and what it decides is key to international thought. The proposed 
framework for assessing H5N1 gain-of-function research, outlined 
by the US National Institutes of Health at an international meeting 
in Bethesda, Maryland, in December, spells out several criteria that 
such research would need to meet before being funded. 

One can quibble with some ambiguities in the wording of those 
proposals, but overall the framework should serve as an important 
checklist. The criteria include sensible questions, such as whether 
safer, alternative approaches exist that could address the same scien- 
tific points. Researchers already accept the need for regulations in areas 
such as animal welfare, and an extra layer of review for gain-of-function 
HS5NI1 research — which will affect only a few projects — is a small 
price to pay for improved public confidence in safety and oversight. 

Flu researchers have been generous with their time over the past 
year. They have engaged in public debates and expressed their often- 
conflicting views in commentaries in scientific journals. The polariza- 
tion of views between proponents and opponents of such research has, 
however, too often resulted in reiterations of entrenched viewpoints, 
rather than substantive discussions. Whether justified or not, there 
remains a perception among many critics that the debate has taken 
place largely behind closed doors, and has been dominated by flu 
scientists and research funders who have vested interests in the outcome. 


As several critics point out, the assessments of the relative risks 
and benefits of such research remain restricted to largely qualita- 
tive arguments. The formal, quantitative risk assessment common 
in the nuclear power and other industries could have helped to nail 
down and quantify risks, and would have informed the debate better. 
One year on, an irreproachable, independent risk—benefit analysis of 
such research, perhaps convened by a body such as the World Health 

Organization (WHO), is still lacking. 


“The lifting of When it comes to mitigating risks, it is 
the moratorium gratifying that the WHO guidelines on 
by researchers mammalian-transmissible H5N1 research, 
must not beseen _ released last July, go beyond simply discuss- 
asclosureofthe _ ing the required level of biocontainment 
debate.” facility. They also recommend that labs doing 


such work should conform to international 
risk-management standards, thus encouraging a culture of safety in 
all procedures and practices. 

The guidelines go on to state [original emphasis]: “Given the poten- 
tial of these newly developed laboratory-modified H5N1 strains to 
start a pandemic, it is important that facilities that are NOT able to 
identify and appropriately control the risks associated with these 
agents REFRAIN from working with them?’ Those are sensible words, 
but unfortunately lack any means of enforcement. 

The lifting of the moratorium by researchers must not be seen as 
closure of the debate. The potential risks of the work demand excep- 
tional precautions in any future research. It is clear that the immediate 
practical applications of gain-of-function flu research remain largely 
hypothetical, and that its true value lies in long-term fundamental 
research to improve understanding of the transmissibility and patho- 
genicity of the virus. That makes it even more incumbent on researchers 
and authorities to exercise the greatest responsibility and prudence. m 


Science stakes 


With the Royal Institution in trouble, Britain’s 
crowded public-science scene must evolve. 


grand building in London's Mayfair, surrounded today by luxury 

shops and private art galleries. For many years, the building was a 
central part of British science. Michael Faraday dazzled crowds there 
in the nineteenth century with pyrotechnic displays of chemistry. 

In many respects, its address, 21 Albemarle Street, is the Royal 
Institution (RI) — hence the consternation in the United Kingdom 
and abroad when The Times newspaper last week reported that the RI 
building was up for sale. The news was no surprise. The RI has been on 
the financial ropes for years, lumbered with the costs of a misguided 
£22-million (US$35-million) refurbishment. 

Richard Sykes, the RI’s current chairman, said last week that the 
charity was likely to be restructured. But the RI, whose property 
includes a remarkable collection of historic scientific equipment and 
documents, insists that it will continue its mission to educate and 
inform the public about science and will not fold. 

In many ways, the RI is a victim of the trend it pioneered. When the 
charity started out in 1799, science itself was a novelty. What would 
now be deemed ‘science outreach’ was even more so. Albemarle Street 
became London's first one-way street, to deal with the crowds that 
headed there. Now, nearly every university encourages its academics 
to push their research to the public, and science communication itself 
has become a career. 

Perhaps more importantly, people who wish to be informed about 
a topic no longer need to sit in an uncomfortable seat and listen to a 


Se 1799, the Royal Institution of Great Britain has occupied a 
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lecture by an éminence grise. While the RI resolutely championed this 
formal mode of engagement, the rest of the world has moved on. The 
vectors of knowledge are the Internet and mass media, not refined 
public meeting rooms. In its defence, the RI has made some attempt 
to modernize, but it is still known to most people as the place with the 
famous old (and very steep) lecture theatre. 

Happily, there remains a market for science events. People flock 
to informal venues and to the type of flamboyant entertainment pio- 
neered by Faraday. Cafés Scientifiques have taken off in many coun- 
tries, and thousands of people attend science festivals in the United 
Kingdom, elsewhere in Europe and in the United States. 

With the future of the RI in severe doubt, those who care about 
science communication in Britain should take this opportunity to dis- 
cuss publicly how the landscape should change. And if such efforts fail 
to be self-sustaining, the RI’s trustees should consider whom to favour 
with the charity’s collection of historic equipment and other resources. 

Here is Nature’s brief guide to the runners and riders, should the RI 
withdraw from the race. The Royal Society does not have the corporate 
stomach or skills to take on a substantial increase in science commu- 
nication and engagement activities. The British Science Association 
has appointed an ambitious new chief executive, and faces a tough 
challenge just to develop its annual public meeting into an event with 
national impact. The Wellcome Trust is strong in the crowded science- 
outreach field, at least in London, but is focused on biomedicine. 

Alongside these, and with a lively pack of mass media, bloggers 
and tweeters snapping at its heels, the RI seems likely to emerge 
redundant, whatever happens to its lovely buildings. The institution 
best positioned to inherit its legacy is the Science Museum, which 
has invested well in showmanship and online 
facilities. It has yet to make its mark asa forum of 
national discussion, but has ambitions to do so. 
And its headquarters are always crowded with 
children and adults, and fun to visit, too. = 
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backache to cancer — was painstakingly detailed in the Global 

Burden of Disease 2010 analysis, filling nearly an entire issue of 
The Lancet in December. One approach to this overwhelming disease 
burden is universal health coverage (UHC), which has been broadly 
defined as universal access to needed health services without financial 
hardship in paying for them. Indeed, Lancet editor Richard Horton 
declared the findings to be so far-reaching that “we should use them as 
a platform to advocate ever more vigorously for the growing consensus 
that universal health coverage could be the third great global health 
transition’ But faced with almost limitless need and finite resources, 
what form of UHC is best? Demand will only rise owing to ageing, 
growing populations and ever-more-sophisticated and expensive tech- 
nologies. UHC must include all available options 
to improve health, but much of its language — 
coverage, access, payment — conveys clinical 
medical care, especially through insurance. 

For UHC to have the impact it promises, we 
must approach it with a different mindset: one that 
is broader, is driven by impacts and goes beyond 
the conventional pay-per-procedure approach. 

Look no farther than the United States to see 
huge sums spent on expensive medical services, 
yet little improvement in the key indicators of life 
expectancy and infant mortality. Clinical services 
clearly have benefits, but their cost-effectiveness 
and impact on population-level health are far 
from clear. That is partly because therapies are not 
always effective and some even harm. Also, many 
systems emphasize payment for procedures and 
not the overall health of the population. But most 
important is that medicine's curative arsenal tends 
to arrive too late to address the drivers of disease. 

The Global Burden of Disease analysis is compelling. Of the top 
ten risk factors, only high blood pressure (which accounts for 7.0% 
of the overall burden) and high blood glucose (3.6%) can be readily 
addressed by the clinical, curative approaches that dominate medical 
services. The others almost entirely elude clinical intervention. And 
even blood pressure and glucose level are highly affected by lifestyle 
factors that the analysis couldn't completely tease out. Moreover, 29 
of the remaining 33 risk factors listed, which include many dietary 
factors, are equally resistant to clinical tools. For developing countries, 
the balance shifts even more towards non-clinical causes, as conditions 
such as suboptimal breastfeeding and sanitation rise in importance. 

Developing countries face particular chal- 


() ur planet's staggering ocean of death and disability — from 


lenges. They tend to have weak public-sector ser- NATURE.COM 
vices, poor private-sector organization and large _Discuss this article 
expenditures. Rapid economic advancement __ online at: 

can offer a way to shift towards more-efficient, —_ go.ature.com/zlinrak 
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~ 3 Ensuring health in 
universal health coverage 


Health systems must transcend clinical medicine and emphasize public- 
health approaches aimed at the drivers of disease, argues James D. Shelton. 


equitable systems and avoid financial hardship, but the pitfall is preoc- 
cupation with clinical services. For example, Ghana’ health-insurance 
programme follows the familiar pattern: reimbursement for mainly 
curative procedures (especially provision of drugs). Family planning 
and immunization are not covered, despite their substantial health ben- 
efits and major downstream cost savings, under the assumption that 
they are addressed by overburdened government clinics. 

UHC will best serve developing countries if five approaches are used. 

First, promoting healthy behaviours, such as exclusive breastfeeding 
and hand washing. Building overall health literacy is a good foundation. 

Second, introducing structural and regulatory approaches such as 
tobacco taxation, clean-air requirements and speed bumps. 

Third, prioritizing the clinical services that have the most impact: 
immunization, family planning and antena- 
tal care. And programmes should be oriented 
towards the overall health of the population, 
rather than just paying for procedures. Health 
reform in the United States in the past few years 
has moved in these directions by making high- 
priority services free, and promoting organiza- 
tions that take overall responsibility for their 
clients’ health. 

Fourth, deploying community-based services 
for high-impact health interventions. Ethiopia's 
Health Extension Worker cadre, which effectively 
provides key services, including sanitation and 
family planning, is a good example. 

Finally, directing research and development 
less towards sophisticated technologies and more 
towards the strongest drivers such as nutrition 
and behaviour and towards low-cost, high-impact 
innovations, such as less-polluting cooking stoves. 

Is it feasible? Yes. The economic extremes of 
Niger and New York City have shown this to be the case. Niger reduced 
child mortality by almost half between 2000 and 2009, emphasizing 
high-impact community-based and behavioural interventions. Almost 
half of the decline is attributed to improved nutrition and insecticide- 
treated bednets. New York City has seen life expectancy rise by 3.8 years 
and infant mortality fall by 23% in the past decade by emphasizing non- 
clinical approaches. Smoking has decreased markedly through increased 
taxes, decreased tobacco-product availability and hard-hitting public- 
health education programmes. Key community prevention-outreach 
programmes include HIV prevention and home visits to families with 
newborns. For true universal impact, other countries must follow suit. m 


James D. Shelton is science adviser, Bureau for Global Health, 
US Agency for International Development, Washington DC, USA. 
The views expressed are not necessarily those of the agency. 
e-mail: jshelton@usaid.gov 
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RESEARCH HIGHLIGHTS 


Black carbona 
warming culprit 


The soot that is emitted into 
the atmosphere from activities 
such as burning of diesel and 
biomass is making a bigger 
contribution to global warming 
than previously thought. This 
finding puts ‘black carbor’ 
second only to carbon dioxide 
in terms of its warming impact. 
Tami Bond at the University 
of Illinois at Urbana- 
Champaign and her colleagues 
analysed data from a ground- 
based network of aerosol 
sensors, run by NASA, as well 
as satellite observations and 
global-emissions inventories. 
The authors found that the 
amount of warming from 
black carbon — which absorbs 
solar radiation and heats the 
atmosphere, as well as melting 
snow and ice — is roughly 
double most earlier estimates. 
Lowering black-carbon 
emissions could be a quick 
way to cool the climate, but the 
overall effect of atmospheric 
aerosols on climate is still 
uncertain, the authors caution. 
J. Geophys. Res. http://dx.doi. 
org/10.1002/jgrd.50171 (2013) 
For more on this research, see 
go.nature.com/ztocgf 


Ceramics make 
water unwelcome 


Water-repellent materials have 
a wide range of applications, 
but many hydrophobic coatings 
cannot withstand harsh 
conditions. Now, researchers at 
the Massachusetts Institute of 
Technology, Cambridge, report 
the production, from rare-earth 


Selections from the 
scientific literature 


Quail pick nests that best hide eggs 


Individuals of a bird species may nest in 
different areas according to which best 
camouflages the patterns on their eggs, report 
George Lovell at Abertay University in Dundee, 


UK, and his colleagues. 


Japanese quail (Coturnix japonica) lay 
speckled eggs that vary greatly in appearance 
between mothers. The authors offered 15 females 
four different colours of sand in which to lay 
their eggs. Using photographic analysis, the team 


oxides, of ceramic materials 
that remain hydrophobic 
even after exposure to harsh 
environments. 

Kripa Varanasi and his 
colleagues synthesized 
ceramics from the lanthanide 
oxide series, including cerium 
oxide. In analysing the surface 
chemistry of the ceramics, 
the researchers found that the 
oxides’ electronic structure 
inhibits hydrogen bonding with 
water molecules. They show 
that water droplets bounce 
offa cerium oxide surface 
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found that females with more highly spotted 
eggs typically chose darker sand that matched 
the colour of the spots. By contrast, quail that 
laid lightly spotted eggs selected paler sand that 


matched the background colour of the eggs. 


(pictured), leaving the surface 
dry. Moreover, the ceramics 
maintain their hydrophobic 
nature even after abrasion and 


exposure to high temperatures. 


Nature Mater. http://dx.doi. 
org/10.1038/nmat3545 (2013) 


Gene linked to 
Alzheimer’s 


Two studies have identified 
a rare genetic mutation 
that boosts the risk of 


The authors suggest that individual female 
quail select egg-laying strategies best suited to 
hiding their eggs from detection by predators. 
Curr. Biol. http://dx.doi.org/10.1016/j. 
cub.2012.12.031 (2013) 


Alzheimer’s disease. 

Kari Stefansson of deCODE 
Genetics in Reykjavik and his 
group analysed the genomes of 
2,261 Icelanders and uncovered 
a mutation in the TREM2 
gene that increased the risk of 
Alzheimer’s disease roughly 
threefold. A separate research 
team, led by John Hardy at 
University College London, 
discovered the same mutation, 
along with several others in the 
same gene, when analysing the 
genomes of more than 1,000 
people with the disease. 

The protein encoded by 
TREM2is expressed in brain 
immune cells called microglia, 
and regulates the process by 
which these cells engulf cell 
debris. The findings suggest 


P.G. LOVELL ET AL. CURR. BIOL. 


that, in the brains of people 
with the neurodegenerative 
disorder, immune cells are 
unable to clear away the 
plaques that are hallmarks of 
the disease. 

N. Engl. J. Med. 368, 107-116; 
117-127 (2013) 


Leprosy bacteria 
reprogram cells 


The bacterium that causes 
leprosy spreads through the 
body by reprogramming 
infected cells to become stem- 
cell-like cells that can form 
other cell types. 

Leprosy is caused by the 
bacterium Mycobacterium 
leprae, which infects Schwann 
cells that insulate nerve cells. 
When Anura Rambukkana at 
the University of Edinburgh, 
UK, and his colleagues infected 
mouse Schwann cells with the 
bacterium, they found that the 
cells expressed genes typical of 
an earlier developmental stage. 
After they were injected into 
mouse muscle, these stem-cell- 
like cells turned into muscle 
cells, and the bacteria spread 
to neighbouring muscle tissue. 
The stem-cell-like cells also 
transferred bacteria to immune 
cells called macrophages, 
which spread the infection 
further. 

Understanding how 
M. leprae reprograms cells 
could inform treatment 
strategies, as well as provide 
tools for regenerative medicine, 
the authors say. 

Cell 152, 51-67 (2013) 
For a longer story on this research, 
see go.nature.com/fbkksy 


Compressed 
picture-taking 


Digital cameras take images 

as arrays of pixels that are 
compressed by algorithms 

into a smaller file. But a system 
designed by John Hunt and his 
colleagues at Duke University, 
Durham, North Carolina, can 
generate a compressed picture 
without the need for a post- 
processing stage. 


The device uses an approach 
that samples image data at 
random, but still includes 
enough information to 
generate a good-quality image. 
The authors used an aperture 
made ofa strip of metamaterial 
— an artificial structure that 
interacts with light in ways not 
found in nature — that guides 
microwaves to a single-pixel 
sensor. The patterns in the 
material are ‘transparent’ to 
certain wavelengths, so the 
signal ‘leaks’ information 
as it moves down the strip, 
allowing for the random data 
sampling. 

Using this system, the 
researchers created a video of 
a moving object's path in one 
dimension. The technology 
could one day be used in 
airport scanners, radar 
systems and infrared imaging, 
the authors say. 

Science 339, 310-313 (2013) 


Irrigation brings 
more rain 


Agricultural irrigation in the 
Central Valley of California 
doubles the amount of water 
vapour in the atmosphere, 
leading to increased rainfall 
throughout the southwestern 
United States. 

James Famiglietti and Min- 
Hui Lo of the University of 
California, Irvine, analysed the 
impact of irrigation on climate 
using a global climate model. 
They found that irrigation adds 
so much water to the region's 
hydrological cycle through 
evaporation and transpiration 
(whereby plants release 
water into the atmosphere) 
that summer precipitation is 
increased by 15%. Asa result, 
summer run-off is raised by 
56% across the region and by 
28% in the Colorado River 
Basin, which is a source of fresh 
water for southern California. 

A better understanding of 
the impact of irrigation on 
regional climate and water 
availability can improve 
resource management, the 
authors say. 

Geophys. Res. Lett. http://dx.doi. 
org/10.1002/grl.50108 (2013) 
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CHOICE 


Cancer drugs from algae 


> HIGHLY READ 
on www.pnas.org 
in December 


Algae could be used to make complex, 
targeted cancer drugs, thanks to their 
photosynthetic organelles, the chloroplasts. 


Immunotoxins are drugs that combine 

a toxin with an antibody; the antibody targets the toxin to 
specific cells, such as cancer cells. However, the drugs have 
proved difficult and expensive to produce. Stephen Mayfield 
and his colleagues at the University of California, San Diego, 
reasoned that the green alga Chlamydomonas reinhardtii 
could be used to produce immunotoxins. The chloroplasts 
of this alga contain the machinery to properly fold complex 
proteins — such as antibodies — and can tolerate some toxins. 

The team used the alga to produce an antibody to CD22 
—a protein found on a type of immune cell called a B cell — 
coupled to a bacterial toxin. The immunotoxin killed B cells 
in culture, and inhibited the growth of human B-cell tumours 


that had been implanted in mice. 


Proc. Natl Acad. Sci. USA 110, E15-E22 (2013) 


ECOLOGY 


Arctic rain brings 
animal pain 


Winter rain, an unusual event 
in the high north, drives animal 
numbers on a Norwegian 
Arctic island into decline, 
showing that extreme climate 
events can affect an entire 
community of vertebrates. 

Brage Hansen at the 
Norwegian University of 
Science and Technology, 
Trondheim, and his team found 
that populations of Svalbard 
reindeer (Rangifer tarandus 
platyrhynchus), rock ptarmigan 
(Lagopus muta hyperborea) and 
sibling voles (Microtus levis) on 
the island of Spitsbergen crash 
during especially rainy winters 
— when the animals’ food is 


coated in layers of ice. The 
extreme weather synchronizes 
fluctuations in the population 
numbers of these herbivores. 
This, in turn, causes numbers 
of the Arctic fox (Vulpes 
lagopus; pictured) — which 
preys on ptarmigans and voles, 
and feeds on reindeer carrion 
— to oscillate, but with a one- 
year lag. 

The overwintering vertebrate 
community could represent 
a bellwether for the influence 
of global warming on Arctic 
ecosystems, if climate change 
raises the prevalence of warmer 
and wetter winters. 
Science 339, 313-315 (2013) 
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BRAGE BREMSET HANSEN 


SEVEN DAYS nescnnss 


US climate agenda 


Barack Obama has vowed to 
put climate change onto the 
nation’s political agenda in his 
second term as president of 
the United States. He delivered 
his second inaugural address 
on 21 January before a crowd 
of thousands on the National 
Mall in Washington DC, 
stating that Americans “will 
respond to the threat of 
climate change, knowing that 
the failure to do so would 
betray our children and future 
generations”. Environmental 
groups criticized Obama for 
not implementing climate 
policies in his first term. 


Yes to flu vaccine 
The US Food and Drug 
Administration on 16 January 
approved Flublok, the first 
seasonal flu vaccine made from 
recombinant proteins, which 

is produced in insect cells 
grown in culture. The standard 
vaccine-production method 
uses inactivated or weakened 
viruses grown in chicken eggs. 
Limited supplies of Flublok 
will be available this winter, 
providing some relief for 
vaccine shortages in the United 
States, which is suffering 

from a severe flu season. 
Development of Flublok by 
Protein Sciences in Meriden, 
Connecticut, was supported by 
the US Biomedical Advanced 


Wind-energy capacity 
installed by the United 
States last year — an annual 
record — with 5.5 gigawatts 
(GW) in December alone. 
US wind capacity is now 

60 GW, 6% of the country’s 
electricity-generating total. 


Insecticides linked to poor bee health 


The European Food Safety Authority (EFSA) in 
Parma, Italy, has warned that three widely used 
neonicotinoid insecticides could pose a risk 

to honeybee health. It recommends their use 

be restricted to crops that are not attractive to 
the insects, amid concerns about declining bee 
populations. Syngenta, the Swiss company that 


Research and Development 
Authority. See go.nature. 
com/2cpzzy for more. 


Gun research 

US President Barack Obama 
has ordered the US Centers 
for Disease Control and 
Prevention (CDC) in Atlanta, 
Georgia, to conduct or fund 
research into the causes of gun 
violence. Obama announced 
the move on 16 January after 

a shooting at an elementary 
school in Newtown, 
Connecticut, in December 
killed 26 adults and children. 
The work ends a 17-year hiatus 
in the CDC’s gun-research 
activities. See go.nature.com/ 
xn75ly for more. 


Mercury capped 

A legally binding treaty to cap 
environmental emissions of the 
toxic metal mercury was agreed 
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by 140 nations on 19 January. 
The treaty has taken four years 
to negotiate and will be open 
for signatures at a meeting in 
Minamata, Japan, in October. It 
is expected to come into force 
within 3-5 years and requires 
countries with artisanal and 
small-scale gold-mining 
operations to draw up national 
plans to reduce and, if possible, 
eliminate the use of mercury. 
See go.nature.com/vqch6y 

for more. 


| RESEARCH 
Flu work resumes 


Flu scientists have partially 
lifted a year-long moratorium 
on research aimed at 
re-engineering highly 
pathogenic H5N1 avian 
influenza viruses to transmit 
efficiently between mammals. 
Ina letter published on 


manufactures one of the chemicals investigated, 
described the EFSA’s work as “hurried and 
inadequate’, and suggested that the agency had 
succumbed to political pressure to reach quick 
conclusions. The EFSA had not responded to 
this allegation when Nature went to press. See 
go.nature.com/v3zdne for more. 


23 January in Nature and 
Science, 40 flu scientists said 
that authorities have now 
reviewed the biosafety and 
biosecurity conditions under 
which the research can be 
done safely, and scientists 
with national approval should 
resume their work. See 

pages 451 and 460 for more. 


India Moon mission 
India’s second mission to the 
Moon, scheduled for 2015, 
will be a solo affair and not 
the joint venture with Russia 
that was originally proposed, 
a top scientist at the Indian 
space agency told Nature on 
22 January. Russia had agreed 
in 2007 to provide a lander 
and rover for the mission, 

but withdrew after a Russian 
mission to Mars’s moon 
Phobos failed in November 
2011, leading to a review of 
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its lander technology. See 
go.nature.com/izhskf for 
more. 


Disaster relief 

The US House of 
Representatives on 15 January 
approved some US$50 billion 
in emergency-relief funding 
to help the east coast recover 
from the destruction caused 
by Hurricane Sandy last year. 
The package includes roughly 
$194.5 million for the National 
Oceanic and Atmospheric 
Administration to improve 
weather prediction and 
monitoring, and $15 million 
for NASA to repair damaged 
facilities. The Senate is 
expected to consider the bill in 
the week of 21 January. 


PEOPLE 


Space chief quits 
The president of the Canadian 
Space Agency, former 
astronaut Steve MacLean, 
announced that he is to leave 
the agency on 1 February 
after five years in the job. 

He is moving on to lead a 
research team in quantum 
physics at a new venture to 

be created by Mike Lazaridis, 
former co-chief executive 

and co-founder of the 
Research in Motion company, 
which makes BlackBerry 
smartphones. See go.nature. 
com/bxeeq/7 for more. 


TREND WATCH 


The Carter Center in Atlanta, 
Georgia, says that it is in the 


final stages of wiping out Guinea 
worm disease (dracunculiasis), 


which is carried by larvae in 


contaminated water. The disease 
was once endemic in 21 countries 
in Africa and Asia, but just 542 


cases were reported in 2012, 


most of them in South Sudan (see 
chart). There are no medicines 

to treat dracunculiasis, but with 
health education and by teaching 
people to filter drinking water, 
the centre aims to eradicate it in 


the next few years. 


USGS departure 


Marcia McNutt (pictured), 
director of the US Geological 
Survey (USGS), has 
announced that she will resign 
on 15 February. McNutt, 

a geophysicist, will stay to 
oversee the launch of the 
Landsat 8 Earth-observation 
satellite scheduled for 

11 February. Her resignation 
is one of several leadership 
changes at national agencies as 
Barack Obama heads into his 
second term as US president. 
On 16 January, Ken Salazar 
announced his resignation as 
head of the Department of the 
Interior, the parent agency of 
the USGS. 


FACILITIES 
Fusion home 


The European Union has 
awarded a €300-million 
(US$399-million) contract 
to build the housing for 
ITER, the international 
experimental fusion reactor, 
to a consortium of French 
and Spanish construction 


companies. Completion of the 
building is slated for mid- 
2018, and ITER is scheduled to 
conduct its first experiments 
in November 2020. The total 
cost of the project, located 

at Cadarache in the south of 
France, is estimated at roughly 
€15 billion. See go.nature.com/ 
uvjax7 for more. 


Reef lab saved 


A beleaguered underwater 
research laboratory in 

Florida was thrown a lifeline 
on 15 January after Florida 
International University in 
Miami agreed to take over 

the running of the facility in 
2013. The Aquarius Reef Base, 
owned by the US National 
Oceanic and Atmospheric 
Administration (NOAA), was 
set to close because of federal 
budget cuts. The university 
has received US$600,000 from 
NOAA towards maintaining 
the lab this year, but the base's 
long-term future remains 
uncertain. See go.nature. 
com/4wup3y for more. 


Institution for sale? 


The luxurious west London 
headquarters of the esteemed 
Royal Institution, where 
electrical pioneer Michael 
Faraday made many of his 
discoveries and established 
the prestigious Christmas 
lectures, could be put on sale 
for upwards of £60 million 
(US$95 million). Ina 
statement on 17 January, 


GUINEA WORM ALMOST WIPED OUT 


Dracunculiasis would be the second human disease 


to be eradicated, after smallpox. 
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SEVEN DAYS | THIS WEEK | 


29 JANUARY 

NASA is scheduled 

to launch the first 

in a new generation 

of communications 
satellites to transfer data 
from the International 
Space Station and agency 
satellites, craft and 
balloons. Its most recent 
data-relay satellite was 
launched a decade ago. 
tdrs.gsfc.nasa.gov 


30 JAN-2 FEB 
ScienceOnline — a 
conference for writers 
and researchers to 
discuss science and 
science communication 
on the Internet — takes 
place in Raleigh, North 
Carolina. 
scienceonline.com 


chairman Richard Sykes 
said that the 200-year-old 
institution would consider 
subletting or selling the 
building. The move follows 
years of financial turmoil 
and internal strife at the 
institution, which in 2005-08 
spent £22 million ona 
controversial refurbishment 
that left it short of funds. See 
go.nature.com/xoeldp and 
page 452 for more. 


Slashed budget 


A troubled cancer-research 
agency in Texas faces a 
massive budget cut announced 
in the state's preliminary 
spending plans on 14 January. 
The Cancer Prevention and 
Research Institute of Texas in 
Austin is set to receive only 
US$5 million per year for 2014 
and 2015, down from nearly 
$300 million per year. The cut 
reflects criticisms last year that 
some grants awarded had not 
gone through the institute's 
peer-review system. See 
go.nature.com/sjyomo 

for more. 
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Isotopes in this ice core reveal how glaciers responded to climate change more than 100,000 years ago. 


| CLIMATOLOGY | 


Greenland defied 
ancient warming 


But Antarctic glaciers may be more vulnerable than thought. 


BY QUIRIN SCHIERMEIER 


ver a few exceptionally warm days last 
() July, Greenland’s frozen surface turned 

into a colossal puddle. Even the coldest 
parts of the world’s largest island saw ice thaw 
and rain fall, fuelling concerns over the future 
of glaciers that hold enough water to raise 
global sea levels by around 7 metres. 

A record of the past written in an ancient ice 
core now reveals that Greenland’s ice sheet is 
not melted as easily as some fear. But the mes- 
sage is not entirely reassuring: it also implies 
that Antarctica has much greater potential to 
raise sea levels than previously thought. 

The 2,540-metre-long core comes from a site 
in northwestern Greenland that hosts some of 
the island's oldest ice, dating back to the Eemian 


interglacial: the Earth’s most recent warm 
period before temperate conditions returned 
once more some 12,000 years ago. Lasting 
from about 130,000 to 115,000 years ago, the 
Eemian warming was triggered by increases in 
summer sunshine, resulting from variations in 
Earth’s orbit and axis of rotation. But it may be 
a good model of the man-made climate change 
expected over the coming centuries. “We are in 
asimilar climate regime as the world was in the 
early Eemian,” says Eric Steig, who studies ice 
cores at the University of Washington in Seattle. 

The North Greenland Eemian Ice Drilling 
(NEEM) project, a 14-nation research team, 
has spent the past four years drilling and ana- 
lysing the core, and its results are published in 
Nature this week’. Extracting a record of the 
Eemian was a challenge: the core’s lowest layers 


had been deformed and folded by the constant 
movement of the ice sheet. “Of course we had 
hoped for a purer record,’ says Dorthe Dahl- 
Jensen, a palaeoclimatologist at the University 
of Copenhagen who led the NEEM project. “But 
it is a fantastic record even so, and it does have 
all the information we needed to reliably recon- 
struct the Eemian climate and ice-sheet history:’ 

She and her colleagues painstakingly com- 
pared and synchronized the disturbed parts 
of their core, layer by layer, with other well- 
dated ice cores from Greenland and Antarctica 
to tease out the story. “Iam very sure that we 
have got the picture right,” says Dahl-Jensen. 

The results confirm the warmth of the 
Eemian climate: ratios of oxygen and nitrogen 
isotopes in the core show that some 6,000 years 
after the onset of the Eemian, local tempera- 
tures reached about 8°C above the present-day 
annual average of roughly —25°C. 

But the ice sheet at the NEEM site did not 
get much thinner than its present 2.5 kilome- 
tres, according to the air content in the core 
that the team correlated with elevation. Before 
the Eemian, the Greenland ice sheet was some 
200 metres thicker than it is today, and after six 
millennia of strong warming the ice sheet lost 
just over one-tenth of its thickness, dropping 
to 130 metres below its present height. It then 
remained steady until the end of the Eemian 
(see “Warm spell’). “It's illuminating to see just 
how strongly Greenland had warmed at the 
time,” says Steig. “That the warming seem- 
ingly caused only relatively modest melting is 
of course also remarkable.” 

There is a small possibility that the recon- 
struction is incomplete because it reflects con- 
ditions at only one site, which may provoke 
debate about the results, says Steig. Until the 
past few years, most researchers had thought 
that Greenland contributed at least half of the 
6-8 metres of Eemian sea-level rise that has 
been deduced from records of ancient cor- 
als and other markers’. Yet the NEEM core 
implies that Greenland’s ice sheet lost at most 
one-quarter of its volume, and contributed no 
more than 2 metres of sea-level rise. “The good 
news is that Greenland is not as sensitive to cli- 
mate warming as we thought,’ says Dahl-Jensen. 


“The bad news is that if 
> NATURE.COM Greenland’s ice sheet did 
Read more about not disappear during the 


the melting Arctic in 
Nature's special at. 
go.nature.com/sSw2pr 


Eemian, Antarctica must 
have been responsible 
for a significant part > 
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> ofthe sea-level rise,’ she adds. These two ice 
sheets, the world’s biggest, have been stable for 
most of the current interglacial period. But since 
temperatures began to soar a couple of decades 
ago, Greenland and Antarctica have been shed- 
ding ice fast. Between 1992 and 2011, they lost 
around 2,700 billion and 1,350 billion tonnes of 
ice, respectively — enough to raise sea levels by 
about 0.6 millimetres per year’. Scientists think 
that by 2100, the global sea level may have risen 
by 0.5-1.2 metres above current levels. 
Although ice loss is currently greatest in 
Greenland, that could change: some parts of 
Antarctica are warming almost twice as fast as 
previously believed’, and glaciers in western 
Antarctica have retreated at a worrying rate in 
the past few decades°. Furthermore, Eemian 
sea-level rise seems to have proceeded in dras- 
tic jumps, rather than gradually’, suggesting 
that the ongoing sea-level rise could accelerate. 
That may not be a menace to civilization 
yet. But, says Dahl-Jensen, if Antarctica’s mas- 
sive ice sheets do disintegrate — as the NEEM 
core suggests they did before — we could face 
an extremely rapid sea-level rise around the 
world. NEEM’s message is that the Eemian is 
distant only in years, not in consequence. m 
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WARM SPELL 

The Eemian interglacial period (130,000-115,000 
years ago) began with a burst of climate warming — 
but this caused only a modest shrinkage of the ice 
sheet that covered Greenland at the time. 
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Work resumes on 
lethal flu strains 


Study of lab-made viruses a ‘public-health responsibility’. 


BY DECLAN BUTLER 


ninternational group of scientists this 

week ended a year-long moratorium 

on controversial work to engineer 
potentially deadly strains of the H5N1 avian 
flu virus in the lab. 

Researchers agreed to temporarily halt 
the work in January 2012, after a fierce row 
erupted over whether it was safe to publish 
two papers reporting that the introduction ofa 
handful of mutations enabled the H5N1 virus 
to spread efficiently between ferrets, a model 
of flu in mammals (see Nature http://doi.org/ 
fxv55r; 2012). Both papers were eventually 
published, one in Nature’ and one in Science’. 

Now, ina letter simultaneously published 
on 23 January by Nature’ and Science, the 
40 scientists involved say that the morato- 
rium has served its purpose: allowing time 
for authorities to review the conditions under 
which the research could be safely conducted 
and for scientists to explain the public-health 
benefits of the work. Scientists who now have 
official approval in their countries to conduct 
such research “have a public-health respon- 
sibility to resume this important work’, the 
letter states, “because the risk exists in nature 
that an H5N1 virus capable of transmission 
in mammals may emerge”. 

The move follows a large international 
workshop convened on 17-18 December 
by the US National Institutes of Health in 
Bethesda, Maryland, to discuss ‘gain-of-func- 
tion research — that intended to increase the 
transmissibility, host range or virulence — in 
H5N1 viruses, and the development of US 
rules for stricter oversight of research in this 
area. The proposed rules require an assess- 
ment of, for example, whether the scientific 
aims of such studies could be addressed 
using alternative, less-risky approaches, and 
whether biosafety and biosecurity risks can 
be adequately mitigated. They are expected 
to enter into force soon, allowing scientists 
working in the United States or on US-funded 
grants to restart such research. 

The groups that published the original 
research have outlined a suite of possible 
follow-up experiments, including a search for 
other combinations of mutations that would 
allow H5N1 to transmit between mam- 
mals — which could answer basic-science 
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questions and, they argue, aid efforts to 
watch for dangerous mutations in the wild. 
The researchers also suggest extending the 
studies in ferrets to other mammals, such 
as guinea pigs, because further evidence of 
transmission within mammalian species 
would increase confidence that the mutated 
virus would transmit between humans. 

But the scientific community remains 
divided on whether the practical benefits of 
the research outweigh the risks of an acciden- 
tal or deliberate release of a lab-created flu 
strain. Ian Lipkin, a specialist on emerging 
infectious diseases at Columbia University 
in New York, believes that the risks are high 
and, worse, that such research may end up 
being done in labs with insufficient biosafety 
standards. 

The World Health Organization (WHO) 
posted general biosafety guidelines for such 
work on its website last July (go.nature. 
com/4z4yzg), but Lipkin says such guide- 
lines need to be extended and given more 
teeth before work restarts. He suggests that 
this could be done by including them in the 
WHOsinternational legally binding treaty on 
global threats to health — the 2005 Interna- 
tional Health Regulations. Ron Fouchier at 
Erasmus Medical Centre in Rotterdam, the 
Netherlands, who led the research behind last 
year’s Science paper, disagrees. He says that 
national and institutional procedures have 
long proved adequate. “If we have to wait until 
all national governments in the world agree 
on terms and conditions, we can wait for years 
if not forever, he says. “That is unacceptable” 

But even some who support the lifting 
of the moratorium have misgivings about 
the future. Ilaria Capua, a flu researcher at 
the Veterinary Public Health Institute in 
Legnaro, Italy, who signed the letter, says that 
she is less concerned about current work, 
which is limited to a handful of labs with 
high biosafety standards, than about the risk 
of proliferation of such research in the longer 
term. “This is not a decision for scientists,” 
she says, “it’s a decision for policy-makers; do 
we want to continue to invest public funds in 
this type of work?”m SEE EDITORIAL P.251 
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DEEP CLEANING 


During the Deepwater Horizon oil spill, 
dispersant chemicals were pumped 

near the wellhead 1,500 metres down, 
and sprayed on to the surface slick. 
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Researchers debate 
oil-spill remedy 


Oilindustry maintains that dispersants should be part of 
routine response to deep-water blowouts. 


BY MARK SCHROPE 


o aspect of the 2010 Deepwater 
| \ | Horizon oil spill in the Gulf of Mexico 
was more controversial than the deci- 
sion to pump massive doses of chemical dis- 
persant into the oil gushing from 1,500 metres 
down (see ‘Deep cleaning’). Advocates said 
that the mixture of solvents and detergent 
would separate the deep oil plume into finer 
droplets, speeding its breakdown. Critics 
feared damage to deep-water ecosystems. 
This week, researchers at the Gulf of Mexico 
Oil Spill and Ecosystem Science Conference 
in New Orleans, Louisiana, are assessing the 
outcome — and sometimes drawing mark- 
edly different conclusions from the scant data. 
Industry scientists argue that the nearly three 
million litres of subsea dispersant worked 
as expected and caused minimal ecological 
damage. Dispersant, they say, should be a 
standard option for fighting future sea-floor 
blowouts. But other researchers say that apply- 
ing dispersants at depth has not yet been 
proved to be effective, let alone safe. 
Both the US National Oceanographic and 
Atmospheric Administration and industry 


representatives have touted aerial photos 
showing that the surface oil plume in the Gulf 
of Mexico diminished after dispersant was 
applied. And oil company BP, which owns the 
well, reported improvements in the air quality 
measured from work ships, suggesting that less 
oil was floating to the surface. 

Seawater samples collected at depth during 
the spill for monitoring by the US Environ- 
mental Protection Agency also suggest that 
the dispersants worked, according to data 
presented by Kenneth Lee, a marine biologist 
at Fisheries and Oceans Canada in Dartmouth. 
He and his team documented droplet sizes that 
are consistent with lab experiments in which 
dispersant and oil are mixed in a wave tank. 

Some post-spill results described at the con- 
ference this week offer support. Eric Adams, 
an engineer at the Massachusetts Institute of 
Technology in Cambridge, and his colleagues 
used glass beads as a proxy for oil in tank 
experiments simulating the spill. Based on 
their results, he says, “If your goal is to spread 
the oil out, then dispersant appears to help.” 

With an eye to the future, the American 
Petroleum Institute, an industry group 
in Washington DC, is studying the most 
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efficient means of injecting dispersant into 
a deep oil plume, as well as considering 
designs for dispersant applicators that could 
be installed at the wellhead in case of a spill. 
Study-group members are presenting their 
early findings at the conference. “We think 
subsurface dispersant played a critical role 
and that that response effort had a positive 
outcome,” says Emily Kennedy, a policy 
analyst at the institute. 

The oil industry’s confidence leaves some 
researchers wary. “I think it’s incredibly 
premature,’ says Sean Anderson, an ecologist 
at California State University Channel Islands 
in Camarillo, who is part of a group studying 
the spill’. “We're in no way saying dispersants 
should never be used or didn’t work at all; it’s 
a question of, ‘Show us the data to show it was 
actually effective,” he says. 

He and others cite hints that turbulence at 
the wellhead could have caused substantial dis- 
persal on its own. And Claire Paris-Limouzy, 
an oceanographer at the University of Miami 
in Florida, questions some of the positive 
conclusions. Using a computer model, she and 
her colleagues concluded that dispersants may 
have had little effect on the amount of oil that 
ultimately surfaced’. 

The data are even thinner when it comes to 
the ecosystem effects of the chemicals, or of the 
oil they might have helped to disperse through 
the depths. Biologists conducting the govern- 
ment assessment of effects on deep-sea fish, for 
instance, concluded that they couldn't quantify 
any impacts because of a lack of baseline data 
on these populations. 

In 2010, however, researchers found soft 
corals that had apparently been killed by dis- 
persed oil from the spill’. At the conference, 
Charles Fisher, a deep-sea biologist at Penn- 
sylvania State University in University Park, 
reported that the spill damaged at least one 
deep coral stand and possibly two more. His 
collaborators are also reporting substantial 
losses of sea-floor animals, such as worms, 
downstream of the spill. And in shipboard 
tests, Erik Cordes of Temple University in Phil- 
adelphia, Pennsylvania, who collaborates with 
Fisher, showed that an oil-dispersant mixture 
is highly toxic to deep-sea soft corals, which 
can take hundreds of years to grow. 

Cordes admits that dispersant use presents a 
difficult choice: the possibility of faster break- 
down of spilled oil against what could be a 
greater environmental impact from the finely 
dispersed oil. “I don’t know where the trade-off 
lies,” he says. “But my gut and what I’ve seen in 
these experiments tells me I would rather the 
oil just go a little further and last a little longer 
than have the oil and dispersant causing that 
kind of damage.” m 
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International aid projects 
come under the microscope 


Clinical-research techniques deployed to assess effectiveness of aid initiatives. 


BY NATASHA GILBERT 


ot long ago, ‘research’ wasa dirty word 
N: international-development circles. 

The prevailing view was that the time 
and money available should be spent imple- 
menting aid projects rather than analysing their 
effects in detail. For most projects, assessment 
was limited to tracking how much they spent 
and whether they reached their end points. 

That is now beginning to change. In recent 
months, studies have rigorously assessed aid 
projects such as farmer-training efforts and 
intestinal-worm-treatment programmes. These 
studies reflect a more analytical mindset that has 
emerged in the development community over 
the past decade, spurred by the need to assure 
weary donors that their investments are paying 
off. Drawing on methods used in clinical stud- 
ies, the analyses could help to guide policy — 
but they are also raising fears that programmes 
could be axed prematurely if initial results are 
disappointing. 

“As the larger and more careful impact 
studies come out, we will see more and more 
negative results,” says Macartan Humphreys, 
an international-development economist at 
Columbia University in New York. 

The Millennium Challenge Corporation 
(MCC), a US foreign-aid agency, has taken the 
lead in self assessment, committing to using 
scientific methods to analyse the success of 
40% of its projects. Its first assessments — of 
farmer-training activities in five countries 
including Armenia, El Salvador and Ghana — 
delivered a mixed verdict. 

Published in October 2012, the evaluations 
showed that in three of the countries, efforts to 
train farmers in business and agricultural skills 
helped them to sell more produce, boosting 
farm incomes. But, contrary to the assumption 
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In Bangladesh, women received nutrition advice through an aid programme, but few were able to use it. 


that greater agricultural production reduces 
poverty, there was no evidence that the extra 
cash flowed to the farmers’ households — an 
effect that the MCC cannot readily explain. “We 
are pushing back the boundaries of ignorance by 
doing these studies,” says William Savedoff, an 
economic and social-development researcher at 
the Center for Global Development in Washing- 
ton DC, who was not involved in the evaluation. 
“They are forcing us to grapple with what we do 
and don't know about the links between agri- 
cultural extension and poverty:’ 

Some of the MCC’s farmer-training assess- 
ments relied on randomized controlled 
trials (RCTs), a mainstay of clinical research. 
In development research, RCTs randomly 
enrol people in aid projects — equipping 
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households with bed nets to protect against 
disease-carrying mosquitoes, for example 
— and then track them along with an equal 
number of people not benefitting from that 
aid. This protocol allows researchers to eval- 
uate whether a given development strategy 
makes a measurable difference to people’s 
lives. “We think RCTs are very effective but 
still underused,’ says development economist 
Rachel Glennerster, a director of the Abdul 
Latif Jameel Poverty Action Lab (J-PAL) at 
the Massachusetts Institute of Technology in 
Cambridge. 

Glennerster says that J-PAL researchers rely 
heavily on RCTs in assessing aid projects, but 
that not everyone regards them as a gold stand- 
ard. Jeffrey Sachs, a sustainable-development 
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economist at Columbia University, worries 
that RCTs are not an ethical way to assess 
development projects, because they withhold 
aid intervention from control groups. Still, 
enough RCTs have been done for researchers 
to begin systematic reviews of particular inter- 
ventions — but these meta-analyses are also 
attracting criticism. 

Last year, for example, a systematic review 
of programmes to treat children in develop- 
ing countries for intestinal worms found little 
evidence of nutritional, cognitive or educa- 
tional benefit (D. C. Taylor-Robinson et al. 
Cochrane DB Syst. Rev. CD000371; 2012). 
The study was conducted by the Cochrane 
Collaboration, based in Oxford, UK, which is 
best known for its systematic reviews of medical 
treatments. 

A group of prominent development research- 
ers — some of whom, including Glennerster, are 
involved in deworming projects — argued that 
the review omitted or discounted key studies 
that showed benefits to school performance. 
“We were critical of the review because it just 
takes a take bunch of studies, averages them and 
then finds there is no effect, when actually if you 
look at high-quality [primary] studies you do 
see an impact,’ says Glennerster. 

David Taylor-Robinson, a population- 
health scientist at the University of Liverpool, 
UK, and lead author of the review, stands by its 


findings. “Our analysis was limited to [RCTs] 
comparing mass administration with placebo 
or no treatment,’ he says, adding that three 
studies showing positive outcomes did not 
meet these criteria. 

To aid such reviews, the International 
Initiative for Impact Evaluation (3ie), a non- 
profit organization based in Washington 
DC that funds and conducts aid-assessment 
research, is setting up a database in which 
researchers can register studies. Expected to 
launch later this year, the initiative aims eventu- 
ally to provide a complete listing of assessments 
for various types of aid interventions, says How- 
ard White, executive director of 3ie. 

The goal is to help researchers avoid bias 
when conducting systematic reviews of devel- 
opment projects — by selectively reporting 
positive results or excluding negative ones. It is 
not yet clear whether development researchers 
will be required to register studies before pub- 
lishing results in academic journals — as is the 
case for clinical trials in some countries. 

Meanwhile, international-development 
researchers are increasingly applying ‘theory 

of change’ an analyti- 
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need to understand the causal mechanisms to 
say there is a link between a treatment and an 
outcome,’ explains White. “But we would like 
to understand more about the causal chain to 
help inform analysis and understand why pro- 
grammes work in some places and not others.” 

In 2005, the World Bank carried out one 
such analysis, of its programme to reduce mal- 
nutrition in Bangladesh. Between 1995 and 
2002, the project taught mothers about nutri- 
tion — for themselves during pregnancy and 
for young children. Falling malnutrition rates 
in programme areas were initially hailed as a 
success, but an evaluation showed that similar 
trends had occurred in control areas, suggesting 
that the programme was not the driving factor. 

The analysis found that one of the main 
reasons the programme failed to make a 
difference was that fathers tended to be in charge 
of what food entered their homes — so mothers 
were unable to implement the nutritional edu- 
cation they received in the programme. 

Depressing as such discoveries might be, 
they are part of an important culture change in 
development circles, says Humphreys. Nega- 
tive results are integral to the research process, 
he argues, and it is important for researchers 
and donors to become more tolerant of them. 
Ifthey do not, “there is a fear that when people 
see negative results, they will stop funding and 
pull out of research altogether”. m 
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BLOT QUT THE LIGHT 


Seeing an Earth-like planet orbiting a star 10 billion times brighter has been 
likened to trying to see a firefly next to a lantern from 10 kilometres away. 
To block the starlight, astronomers are developing five types of coronagraph: 


BAND LIMITED 

Using a mask that resembles a barcode, this 
coronagraph suppresses starlight along the telescope’s 
optical axis while transmitting light from off-axis planets. 


Key challenge: 
Precision manufacture of the barcode-like mask. 


VECTOR VORTEX 


A circular mask rotates the angle of polarization of 
photons from the star, so they cancel each other out. 


Key challenge: 
The need to manufacture the fine circular pattern of the 
mask, made from liquid-crystal polymers, to a precision 


VISIBLE NULLING 


Akin to flying several telescopes in formation, a device 
installed in a single spacecraft generates an interference 
pattern that suppresses starlight. 


Key challenge: 
Extending the technique to a broad band of 
wavelengths. 


PHASE-INDUCED AMPLITUDE APODIZATION 

Two asymmetrical mirrors reshape the incoming light 
beam to shunt starlight off to the side and preserve 
planetary light. 


Key challenge: 
The manufacture of correctly shaped optics. 


STARSHADE 

A petal-shaped structure carried into space on a 
separate spacecraft shades the telescope optics from 
the glare of the star. 


Key challenge: 
Telescope and shade must fly in formation, separated 


of micrometres. 


by tens of thousands of kilometres. 


Fresh bid to see exo-Earths 


Improved instruments and a telescope windfall could aid the search for extrasolar life. 


BY RON COWEN 


s NASA’ Kepler spacecraft finds signs 

that the Galaxy is teeming with Earth- 

sized planets, astronomers are renew- 
inga stalled quest: to gather light directly from 
an Earth twin and tease it apart for the chemi- 
cal signatures of life. That is not possible with 
current techniques, which find planets indi- 
rectly; and in 2006, NASA cancelled a mis- 
sion that might have had a chance at doing 
so. The Terrestrial Planet Finder (TPF) would 
have used either an array of small telescopes or 
one giant, 8-metre mirror to capture planetary 
light. Now a new generation of instruments is 
raising hopes that a smaller, cheaper space tele- 
scope could do the job. 

The TPF cancellation was “a big blow’, says 
Olivier Guyon, an astronomer at the University 
of Arizona in Tucson who also works on Japan's 
Subaru Telescope in Hawaii, “but it also felt like 
a challenge to see what we could do with smaller 
telescopes”. Guyon and others are hoping to 
meet that challenge with advanced corona- 
graphs: telescope devices that mask starlight 
like an artificial eclipse, allowing nearby planets 
that would otherwise be obscured by the star's 
glare to be seen (see ‘Blot out the light’). NASA 
is exploring the idea of putting a coronagraph 
on one of a pair of 2.4-metre space telescopes 
donated to the agency by the National Recon- 
naissance Office, which operates the US fleet 


of spy satellites (see Nature 490, 16-17; 2012). 
Although a telescope that size — the same as 
the Hubble Space Telescope — would be hard- 
pressed to gather light from Earth-sized planets, 
it could image and take chemical spectra from 
planets the size of Jupiter and possibly even 
smaller than Neptune. 

The impetus for such a mission is growing. 
This month, at a meeting of the American 
Astronomical Society in 
Long Beach, California, 
astronomers estimated 
the number of planets in 
the Milky Way, extrapo- 
lated from the number 
and types of planet that 
Kepler has already found 
in one small part of the Galaxy. The result: at 
least 100 billion extrasolar planets, one for every 
star. Even more tantalizingly, a potentially hab- 
itable Earth-sized orb is likely to reside within 
6 parsecs (20 light years) of the Solar System. 

Yet NASA, strapped for funds and burdened 
with the US$8-billion James Webb Space Tele- 
scope project, is not about to resurrect the TPF. 
Planet hunters know that a smaller, pathfinder 
mission is their best chance. “This is the only 
show in town,’ says Alan Boss, an astrono- 
mer at the Carnegie Institution for Science in 
Washington DC. 

A NASA-appointed team of astronomers, 
the Astrophysics Focused Telescope Assets 
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committee, will on 30 April deliver a report on 
the feasibility of using the donated telescopes 
for several types of observation, including 
planet imaging. Any planet hunting would 
have to be squeezed into a jam-packed opera- 
tions schedule: astrophysicists are also eager 
to use the telescopes to study the mysterious 
phenomenon known as dark energy, which 
is accelerating the expansion of the Universe. 

The structure of the instruments could also 
bea problem. The 2.4-metre telescopes contain 
two mirrors; the secondary one is held by six 
struts that cause light scattering, says Wes Traub, 
chief scientist for NASA’ exoplanet exploration 
programme at the Jet Propulsion Laboratory in 
Pasadena, California. “The struts couldn't be in 
a worse place for imaging planets,” he says. 

Even so, exoplanet astronomers are enthu- 
siastic. Coronagraphs have improved vastly 
in recent years: just a decade ago, says Guyon, 
they were good at blotting out most of the 
star’s central light, but also threw away about 
90% of the light reflected from the planet. One 
problem was caused by the sharp edges of the 
telescope mirror, which created a ring-like 
diffraction pattern in the region of the image 
where the planet resides. 

Guyon’s coronagraph 
uses two asymmetri- 
cally polished mirrors to 
reshape the distribution 
of light gathered by the 


For more about 
exoplanets, see: 


telescope’s main mirror and eliminate the dif- 
fraction pattern. The design, which in 2012 won 
him a ‘genius’ grant from the MacArthur Foun- 
dation, allows the starlight to be thrown away 
without sacrificing light from the planet. Guyon 
says that his device could enable a 4-metre tele- 
scope, half the diameter of the TPE to see an 
Earth twin — but whether it could enable a 
2.4-metre telescope to find Earths remains to 
be seen. “We're still in the grey zone,” he says. 


Dust adds to the uncertainty. Asteroid colli- 
sions have filled the inner Solar System with a 
haze of dust that reflects sunlight, creating what 
is known as zodiacal light. The worst nightmare 
of planet hunters is the possibility that the dust 
in other solar systems could generate so much 
‘exozodiacal’ light that Earth-sized orbs would 
be lost within it. “This is the main non-techno- 
logical risk for an exo-Earth imaging mission,” 
says Ruslan Belikov, an astrophysicist at the 
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NASA Ames Research Center in Moffett Field, 
California. Measuring the levels of exozodiacal 
light around Sun-like stars is a key goal for a 
pathfinder mission, he adds. 

Even if NASA decides against putting a 
coronagraph on one of the donated telescopes, 
says Guyon, he is not about to give up. Find- 
ing out if there are other habitable planets in 
the Galaxy, he says, “is the most interesting 
question in all of science”. m 


Japan’s stimulus package 
showers science with cash 


But new leadership’s largesse brings expectations of fast commercial pay - offs. 


BY DAVID CYRANOSKI IN TOKYO 


r | Three years ago, the picture for research 
funding in Japan looked bleak. As part 
of efforts to slash the national budget 

by ¥3 trillion (US$33.5 billion), the govern- 
ment, led by the Democratic Party of Japan, 
had proposed sweeping cuts to science, spark- 
ing protests from the country’s most eminent 
researchers. Japan's flagship K supercomputer 
project narrowly escaped being shut down 
after auditors questioned whether Japan 
needed to host the world’s fastest computer. 

Fast-forward to 2013, and Shinzo Abe, head 
of the newly elected Liberal Democratic Party- 
led government, seems to have no such doubts. 
“Of course we must aim for number one,” he 
told reporters after a tour of the supercomputer 
facility on 11 January. 

Science is a big winner in the government’s 
massive ¥10.3-trillion economic stimulus pack- 
age, approved by the cabinet on 15 January. The 
stimulus flags up the new leadership's determi- 
nation that research should pull its weight in 
dragging Japan’s economy out of recession. 
The supplementary budget provides billions 
of yen for key fields and significant boosts for 
many big scientific facilities (see “Big winners’) 
— including ¥8.4 billion for data links between 
the K supercomputer and Japan’s universities. 

Among the many other winners is the 
SPring-8 synchrotron, which will get ¥2.9 bil- 
lion to spruce up its ageing beamlines, used for 
structural studies of materials and proteins. 
There is also ¥16.6 billion for the interna- 
tional fusion energy project ITER, which has 
been beset by budgetary difficulties since its 
inception (see Nature 487, 420; 2012). Mean- 
while, the Japanese space agency JAXA will get 
¥22.9 billion to refurbish its facilities and to 
speed up development of ALOS-2, a satellite 


that will monitor natural disasters and measure 
atmospheric carbon dioxide levels. The extra 
cash will keep it on schedule for launch before 
April next year. 

But the biggest beneficiary of the Abe gov- 
ernment’s stimulus is stem-cell research, espe- 
cially that geared towards clinical applications. 
The science ministry alone has earmarked 
¥21.4 billion for research on stem cells, mainly 
focused on induced pluripotent stem (iPS) cells 
— reprogrammed adult cells, first developed 
in Japan. 

Yoshiki Sasai, who has been wowing biolo- 
gists and non-scientists alike by growing rudi- 
mentary retinas, brain parts and other tissues 
from stem cells (see Nature 488, 444-446; 
2012), has long been negotiating with the gov- 
ernment for facilities to link basic research at 
the Center for Developmental Biology in Kobe, 
where he works, with clinics and industry. The 
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Some of the facilities or fields that receive big 
funding boosts in Japan's supplementary budget. 
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stimulus package, nonetheless, came “like 
thunder, out of the blue’, he says. The budget 
includes ¥3.8 billion for a new building at Sasai’s 
centre, with planned projects being largely col- 
laborative ventures with Japanese businesses. 

An even bigger windfall will benefit work 
by Shinya Yamanaka, a Nobel laureate who 
developed the first recipe for iPS cells. The 
Center for iPS Cell Research and Application at 
Kyoto University, which he directs, is to receive 
¥4 billion for a building to house research on 
reprogramming mechanisms and clinical 
applications of iPS cells. Another ¥700 million 
will pay to create a cell-processing centre at 
the Foundation for Biomedical Research and 
Innovation in Kobe, mainly to support Masayo 
Takahashi, who works next door at the Center 
for Developmental Biology. Takahashi is plan- 
ning the first trial of iPS cells in humans, in a 
clinical study set to start this year on the eye 
condition known as macular degeneration. 

Other ministries have also jumped on the 
iPS-cell bandwagon. For example, with ¥2.2 bil- 
lion the health ministry plans to build two 
centres to provide training on deriving and 
cultivating iPS cells. 

The stimulus cash could be just the start of 
the good times for Japan's stem-cell researchers. 
The government is expected to announcea ten- 
year, ¥90-billion iPS-cell initiative in next year’s 
budget, beginning in April. 

But the largesse comes with expectations. 
Of the ¥721 billion earmarked for science and 
technology by the science ministry, ¥180 bil- 
lion will go towards translating university 
research into commercial applications, and 
most other projects are framed around clinical 
or industrial applications. With the stimulus, 
says Yamanaka, the government is “telling us to 
rush iPS-cell-related technologies to patients as 
quickly as possible”. = 
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The transparency of 
zebrafish larvae makes the 
animals ideal models for 
seeing deep into tissues. 


FISK BOWL 
NEURUSCIENG 


Tiny fish trapped in a virtual world provide a 


window into complex brain connections. SSE hin 


HUGHES 


recently hatched zebrafish is swimming upriver for 
the first time. Its big round eyes, bulging on the front 
of its eyelash-sized body, scan the surroundings. Sud- 
denly, it sees the scenery flying forwards as a gentle 
current pushes it backwards. The fish flicks its tail to 
try to stay in place. Or so it thinks. 

In reality, the baby fish is paralysed and suspended in a water-filled 
Petri dish by glass pipettes. The dish sits on the stage of a US$100,000 
microscope in the corner of a darkened, cluttered laboratory. A film, 
projected from below, has transported the fish to a virtual world in which 
moving bands of light and dark simulate passing underwater scenery. 

Although the fish doesn’t move, the motor neurons that control its 
tail are firing away, just as if it were swimming. 

And when fed into a computer, those signalscan NATURE.COM 
control the video display, giving the fish nearly Watcha video of 
every sign that it is swimming normally. All the — azebrafishin ‘the 
while, Florian Engert’s microscope peers deep __ matrix’ at. 

into the fish’s tiny, translucent brain to watch __go.nature.com/Ighpmp 
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neurons glow green as they fire. 

Engert, the neuroscientist who developed the set-up, often jokes that 
the fish is just like Neo, a leading character from the 1999 sci-fi thriller 
The Matrix, in which humans have been enslaved by machines but are 
fed a virtual reality that leads them to believe that they are free. Engert’s 
team at Harvard University in Cambridge, Massachusetts, hopes that 
the fish in this aquatic matrix will help to answer the biggest question in 
neuroscience: how a doughy mass of neurons in the brain gives rise to an 
exquisite suite of behaviours, absorbing information from the outside 
world and generating responses. 

Since the late nineteenth century, when Spanish anatomist Santiago 
Ramon y Cajal pinpointed the neuron as the fundamental unit of the 
brain, most neuroscientists have focused on recording the electrical 
buzzing of individual cells. That has tended to mean sticking electrodes 
into the brains of cats, rabbits, rats, mice, sea slugs, squid, monkeys and 
even people. The approach reveals a lot about how neurons respond 
to inputs — such as a chemical 
messenger, a sound or a colour 
— and produce individual fir- 
ing patterns, which the brain 
decodes to drive behaviour. But 
how these cells work together to 
translate and integrate complex, 
real-world sensory inputs — 
such as moving scenery, smells, 
sounds or an approaching pred- 
ator — “is something that’s stilla 
big mystery’, Engert says. “That's 
probably the main challenge for 
the next decade” 

Larval zebrafish (Danio rerio) 
have been a workhorse model 
organism in developmental 
biology labs for about 30 years 
because they are cheap and 
relatively amenable to genetic 
manipulation, and have transpar- 
ent tissues, allowing researchers 
to see inside them. Engert and 
a small group of neuroscientists 
have been looking to capitalize 
on these qualities to study how 
the brain encodes vision, hearing, 
movement and even fear, things 
that are impossible to do in the 
brains of more complex model 
organisms. And techniques such 
as Engert’s matrix are allowing 
them to monitor the zebrafish’s 
300,000 or so neurons and to track activity in vast swathes of them simul- 
taneously in living brains. Such innovations mean that grant reviewers 
and top-tier journals — which overwhelmingly favour neuroscience 
research in mammals — are now giving fish a chance. 

“There is a sort of perfect convergence of model and methodology 
with zebrafish that is peaking right about now,’ says Joseph Fetcho, a 
neurobiologist at Cornell University in Ithaca, New York, who pio- 
neered brain-circuit research in the fish. This impressive array of tools, 
he adds, makes him wonder “why one would use any other model for 
basic questions about circuits and behaviour”. 


Microscope 


Electrodes monitor 
bundles of motor 

neurons in the 
fish’s tail. 


Moving bands of light 
and dark simulate a 
riverbed sliding forward. 


Projector 


SPOTLIGHT AQUARIUM 
During his early career at Stony Brook University in New York, Fetcho 
worked on goldfish brains until he got frustrated recording from only 
a couple of cells at a time. He switched to zebrafish in the mid-1990s, 
after two strikes of serendipity. 

First, at a zoology conference, Fetcho stumbled into a workshop on 


A RIVER OF DECEIT 


A zebrafish larva, paralysed and suspended 
by pipettes, can be fooled into believing it is 
caught in a gentle current. All the while, a 
microscope monitors its brain reactions. 
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using zebrafish in high-school biology classes. He realized how easy 
it was to watch their translucent embryonic cells divide and, over the 
course of just a few days, develop into organs and limbs. 

Second, he came across a paper describing how to fill neurons with a 
green dye that is sensitive to calcium’. Because neuronal firing requires 
an influx of calcium ions, this method provided a way to view cells in 
action. The paper used cells isolated from chick spinal cords, but Fetcho 
thought the same approach could light up the neurons of live zebrafish. 
He went to a local pet shop and bought a mating pair. The next day, he 
had fertilized eggs to experiment with. 

In Fetcho’ first zebrafish paper’, published in 1995, he and his 
colleague Donald O’Malley used the calcium-sensitive green dye to 
track the activity of motor neurons during a predator-escape reflex, 
which is triggered by poking the fish in the head. His team went on to 
show that neurons in different segments of the hindbrain encode how 
the fish turns its body to escape from a predator’. The researchers then 
created the first transgenic line 
of fish to express a calcium indi- 
cator, similar to the green dye, 
in all neurons, so that the dye no 
longer had to be injected’. 

Neuroscientists also started 
to use calcium indicators to 
label neural circuits in other 
animals. In a landmark study in 
2001, for example, researchers 
mounted miniature two-photon 
microscopes — which can probe 
more than a millimetre deep 
into tissues — on the heads of 
scampering rats to reveal the fir- 
ing patterns of many individual 
neurons at once®. Another group 
observed the brain of a fruitfly 
that was secured under a micro- 
scope while its legs walked freely 
on top of a polystyrene ball®. 

But when using rodents or 
flies, researchers must first cut 
windows in the animals’ heads 
to expose the part of the brain 
they want to image. And even 
then, the microscopes can probe 
only superficial layers of these 
opaque brain tissues. 

Only two popular model 
organisms have small, trans- 
parent brains and can be easily 
genetically engineered: the lar- 
val zebrafish and the soil nematode Caenorhabditis elegans, which has 
just 302 neurons. But to zebrafish researchers, C. elegans is too small 
and its circuits are too simple. Its brain is also covered in a tight cuticle 
and its neurons are tiny, making it difficult to make recordings with 
conventional electrodes. Besides, the worms don't show the same variety 
of behaviours — notably, those that require sophisticated vision — that 
a fish does. “We could probably do similar things in C. elegans,” says 
Engert, “but I'd be worried that the answers aren't as interesting.” 

When Engert launched his Harvard lab in January 2002, he was 
determined to focus on the circuitry of larval zebrafish. His former 
adviser’s father had been close friends with Nobel-prizewinning devel- 
opmental biologist Christiane Niisslein- Volhard, who helped to pioneer 
the zebrafish as a model for embryonic development. Still, Engert had 
never seen one in the flesh. “I was quite shocked when I saw one for the 
first time; they're so small,” he says. 

It was an audacious choice, but it fit Engert’s personality and pen- 
chant for risk. He’s known around the Harvard biology building for never 
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wearing shirtsleeves, rollerblading into lectures and riding his motorbike 
without a helmet. A native German, he did his Harvard tenure talk in 
lederhosen (he earned tenure in 2009). Last year, he was nearly trapped 
under an avalanche while skiing off piste in Austria — shirtless. 

At Harvard, it took Engert a couple of years to set up his lab experi- 
ments for zebrafish. At first, he tried imaging neurons while the fish were 
swimming freely. But “their whole brain wiggles’, he says, making imag- 
ing impossible. That’s when he started to build the virtual environment. 

Fora study published last May’, Engert’s postdocs Ruben Portugues 
and Misha Ahrens built a simple virtual world consisting of red and 
black stripes that moved under the fish. This visual stimulus, crude as 
it is, was enough to make the animals feel as though they were being 
swept backwards by a rushing river, and send out muscle commands 
to push forwards. 

With the stroke of a few computer keys, the researchers can manipulate 
the scene, making the stripes go slower or faster. The tweaks make it 
seem, to the fish, that its movements have been either too weak or too 
strong, so it makes adjustments to stay in place. This behaviour is called 
motor adaptation, and it is akin to 
what people do when they’re walk- 
ing and suddenly slide on a patch 
of ice, for instance. The brain takes 
in the new environmental infor- 
mation and adjusts movements to 
prevent a fall. 

Studies in monkeys had 
revealed that specific neuronal 
populations were involved in 
motor adaptation. “If something 
happens that’s unexpected, it 
needs to get processed completely 
differently from if it’s expected,” 
Engert says. The processing 
“either tells me that something is 
happening in the world outside, 
independent of my motion, or it 
tells me there's something wrong 
with my body”. 

His fish study implicated the 
same populations of cells, but 
also showed something new. Cer- 
tain neurons within these popula- 
tions encode high-feedback gains 
— that is, visual feedback telling 
the fish that its muscles are stronger than expected — whereas others 
respond to feedback that its muscles are weaker than expected. 

This is the sort of nitty-gritty detail that neuroscientists, hoping to 
look at the individual neurons within circuits, relish. In the past few 
years, other labs have revealed similar details. In 2010, for example, 
researchers in Japan pinpointed specific neurons in the habenula — a 
deep brain region that is difficult to study in mammals — as crucial 
players in the zebrafish’s response to fear®. In 2011, Fetcho showed that 
neurons in the fish's hindbrain are neatly stacked during development 
in a way that allows the oldest cells to drive the fastest movements and 
younger cells to control more refined motions’. 

But researchers are less excited by Engert’s results than by his 
technology, which allows them to view every neuron in an entire living, 
working brain. “You can't do it in any other animal,’ says Martin Meyer, 
a neuroscientist at King’s College London who used calcium imaging 
to show how different layers of cells in the zebrafish brain respond to 
objects moving in specific directions’. “There’s more or less endless 
scope, once you have that set up” 

Neuroscientists who work on other animals also applaud Engert’s 
technique, although they have some reservations. “It doesnt give you 
everything that there is to know,” says Rex Kerr, who studies nema- 
tode brains at the Howard Hughes Medical Institute’s Janelia Farm 
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Research Campus in Ashburn, Virginia. 

Kerr notes that the two-photon microscope can't actually image all 
300,000 neurons at once. Instead, Engert’s group systematically moni- 
tors 1,000 or so neurons in 300 subregions, from a total of 32 fish, and 
then uses a computer model to merge the activity onto a reference brain. 
For some behaviours, this averaged neural activity can mask interesting 
activity patterns in individual neurons, Kerr says. Still, a lot of interest- 
ing activity is determined by populations of many cells. Zebrafish are 
“extremely valuable” for looking at those ensembles, he says. 


FISH FUTURES 

There are practical reasons why more neuroscientists haven't worked 
on zebrafish. Many sophisticated behaviours — such as communica- 
tion, social interaction and complex emotions — aren't displayed by the 
animalat all. And scientists have yet to develop ways to study even some 
basic reflexes in the fish. 

That means it is not yet clear what other behaviours Engert’s technique 
will be able to test. Jason Rihel, who is setting up a lab at University Col- 
lege London, would like to use a 
similar approach to study neurons 
that produce hypocretin, which are 
involved in sleep and wakefulness. 
“Tf we could watch the whole brain 
while we're tickling the hypocretin 
neurons, or inhibiting them, then 
we might be able to map out every 
neuron in the brain that has altered 
activity,’ Rihel says. He is a bit wor- 
ried, however, that the immobiliza- 
tion will affect the fishes’ slumber. 

Engert, in typical fashion, has 
ambitious plans. He's got each of 
his five postdocs and about eight 
graduate students working ona 
different zebrafish experiment, 
from uncomfortably warm baths 
that test fear-learning to alcohol- 
rich waters to look at the effects of 
positive rewards. 

He is also working on a “side 
project” that is likely to get a lot 
of attention later this year: the 
zebrafish connectome. Engert’s 
team is doing whole-brain func- 
tional imaging of live, baby fish while they look at moving bars, and 
then passing those brains on to Harvard colleague Jeff Lichtman, who 
will use an electron microscope to trace the anatomical connections. 
“We'll have a wiring diagram of the whole brain that can relate structure 
to function,’ Engert says. 

With these types of resource, Engert and his zebrafish might even find 
what neuroscientists have been searching for since Cajal: a fundamental 
principle that describes how circuits interact with one another. “My life 
will not be a failure if it doesn’t happen, but I'd love to find it? Engert says. 
“And it’s ten times more likely to happen in fish than mice?” m 


Virginia Hughes is a freelance science writer in New York City. 
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The ritual animal 


Praying, fighting, dancing, chanting — human rituals could illuminate 
the growth of community and the origins of civilization. 


y July 2011, when Brian McQuinn 

made the 18-hour boat trip from 

Malta to the Libyan port of Misrata, 

the bloody uprising against Libyan 
dictator Muammar Gaddafi had already been 
under way for five months. 

“The whole city was under siege, with 
Gaddafi forces on all sides,” recalls Canadian- 
born McQuinn. He was no stranger to such 
situations, having spent the previous decade 
working for peace-building organizations in 
countries including Rwanda and Bosnia. But 
this time, as a doctoral student in anthropology 
at the University of Oxford, UK, he was taking 
the risk for the sake of research. His plan was 
to make contact with rebel groups and travel 
with them as they fought, studying how they 
used ritual to create solidarity and loyalty amid 
constant violence. 

It worked: McQuinn stayed with the rebels 
for seven months, compiling a strikingly 
close and personal case study of how rituals 
evolved through combat and eventual vic- 
tory. And his work was just one part of a much 
bigger project: a £3.2-million (US$5-million) 


BY DAN JONES 


investigation into ritual, community and 
conflict, which is funded until 2016 by the 
UK Economic and Social Research Council 
(ESRC) and headed by McQuinn’s supervisor, 
Oxford anthropologist Harvey Whitehouse. 

Rituals are a human universal — “the glue 
that holds social groups together’, explains 
Whitehouse, who leads the team of anthro- 
pologists, psychologists, historians, econo- 
mists and archaeologists from 12 universities 
in the United Kingdom, the United States and 
Canada. Rituals can vary enormously, from the 
recitation of prayers in church, to the some- 
times violent and humiliating initiations of 
US college fraternity pledges, to the bleeding 
of a young man’s penis with bamboo razors 
and pig incisors in purity rituals among the 
Ilahita Arapesh of New Guinea. But beneath 
that diversity, Whitehouse believes, rituals are 
always about building community — which 
arguably makes them central to understand- 
ing how civilization itself began. 

To explore these possibilities, and to 
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tease apart how this social glue works, 
Whitehouse’s project will combine fieldwork 
such as McQuinn’s with archaeological digs 
and laboratory studies around the world, from 
Vancouver, Canada, to the island archipelago of 
Vanuatu in the south Pacific Ocean. “This is the 
most wide-ranging scientific project on rituals 
attempted to date,” says Scott Atran, director 
of anthropological research at the CNRS, the 
French national research organization, in Paris, 
and an adviser to the project. 


Human rites 

A major aim of the investigation is to test 
Whitehouse’s theory that rituals come in 
two broad types, which have different effects 
on group bonding. Routine actions such as 
prayers at church, mosque or synagogue, or 
the daily pledge of allegiance recited in many 
US elementary schools, 

are rituals operating in 


what Whitehouse calls For more onrituals 
the ‘doctrinal mode’. _ listentothe Nature 
He argues that these _ podcast. 


rituals, which are easily 


LIFEISPIXELS/GETTY IMAGES (LEFT) 


MADS NISSEN/BERLINGSKE/PANOS(RIGHT) 


transmitted to children and strangers, are well 
suited to forging religions, tribes, cities and 
nations — broad-based communities that do 
not depend on face-to-face contact. 

Rare, traumatic activities such as beating, 
scarring or self-mutilation, by contrast, are 
rituals operating in what Whitehouse calls 
the ‘imagistic mode’. “Traumatic rituals cre- 
ate strong bonds among those who experience 
them together,” he says, which makes them 
especially suited to creating small, intensely 
committed groups such as cults, military 
platoons or terrorist cells. “With the imagis- 
tic mode, we never find groups of the same 
kind of scale, uniformity, centralization 
or hierarchical structure that typifies the 
doctrinal mode,’ he says. 


Rebel yell 

Whitehouse has been developing this theory 
of ‘divergent modes of ritual and religion” 
since the late 1980s, based on his field work in 
Papua New Guinea and elsewhere’. His ideas 
have attracted the attention of psychologists, 
archaeologists and historians. 

Until recently, however, the theory was 
largely based on selected ethnographic and 
historical case studies, leaving it open to the 
charge of cherry-picking. The current ritu- 
als project is an effort by Whitehouse and his 
colleagues to answer that charge with deeper, 
more systematic data. 

The pursuit of such data sent McQuinn 
to Libya. His strategy was to look at how the 
defining features of the imagistic and doctri- 
nal modes — emotionally intense experiences 


shared among a small number of people, com- 
pared with routine, daily practices that large 
numbers of people engage in — fed into the 
evolution of rebel fighting groups from small 
bands to large brigades. 

At first, says McQuinn, neighbourhood 
friends formed small groups comprising 
“the number of people you could fit in a car”. 
Later, fighters began living together in groups 
of 25-40 in disused buildings and the man- 
sions of rich supporters. Finally, after Gaddafi’s 
forces were pushed out of Misrata, much larger 
and hierarchically organized brigades emerged 
that patrolled long stretches of the defensive 


border of the city. There was even a Misratan 
Union of Revolutionaries, which by November 
2011 had registered 236 rebel brigades. 

McQuinn interviewed more than 300 fight- 
ers from 21 of these rebel groups, which varied 
in size from 12 to just over 1,000 members’. He 
found that the early, smaller brigades tended 
to form around pre-existing personal ties, and 
became more cohesive and the members more 
committed to each other as they collectively 
experienced the fear and excitement of fighting 
a civil war on the streets of Misrata. 

But six of the groups evolved into super- 
brigades of more than 750 fighters, becoming 


FEATURE 


FORGING LOYALTY 


Members of large, 
stable groups 

such as religions, 
tribes and nations 
typically reinforce 
their commitment 
with routine rituals 
such as Buddhist 
prayers in Thailand 
(facing page). But 
members of small, 
committed groups 
such as the rebel 
cells that gathered 
in Benghazi, Libya, 
in March 2011 (this 
page) are often 
initiated by rituals 
or other experiences 
that are frightening 
and traumatic. 


“something more like a corporate entity 
with their own organizational rituals’, says 
McQuinn. A number of the group leaders had 
run successful businesses, and would bring 
everyone together each day for collective 
training, briefings and to reiterate their moral 
codes of conduct — the kinds of routine group 
activities characteristic of the doctrinal mode. 
“These daily practices moved people from 
being ‘our little group’ to ‘everyone training 
here is part of our group,” says McQuinn. 

McQuinn and Whitehouse’s work with 
Libyan fighters underscores how small groups 
can be tightly fused by the shared trauma of 
war, just as imagistic rituals induce terror to 
achieve the same effect. Whitehouse says that 
he is finding the same thing in as-yet-unpub- 
lished studies of the scary, painful and humili- 
ating ‘hazing’ rituals of fraternity and sorority 
houses on US campuses, as well as in surveys of 
Vietnam veterans showing how shared trauma 
shaped loyalty to their fellow soldiers. 

To gain a more global perspective on ritual 
practices, Whitehouse and Quentin Atkinson, 
a psychologist at the University of Auckland, 
New Zealand, and a member of the project, 
used a previously developed database contain- 
ing information on world cultures to explore 
the connections between frequency, peak 
levels of emotional arousal, and average com- 
munity size for 645 rituals across 74 cultures’. 
As predicted, the rituals fell into two clusters: 
low-frequency but high-arousal imagistic vari- 
eties that were more common in societies with 
a smaller average community size, and high- 
frequency, low-arousal doctrinal rituals that 
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were more established in societies in which 
communities are larger. 

Given these data from contemporary 
cultures, it is hard not to speculate about ritual’s 
role in history: did the transition from imagistic 
mode to doctrinal mode, with its emphasis on 
a common identity buttressed by daily activi- 
ties and rituals, play a part in the emergence of 
large, complex societies 10,000 years ago? 


The birth of civilization? 

To address that question, Whitehouse, Atkinson 
and Camilla Mazzucato, also based at the 
University of Oxford, are looking at archaeo- 
logical data from Catalhéyiik, one of the 
largest and best-preserved Neolithic towns 
known. Located in the Anatolian plains of 
northwestern Turkey, Catalhéytik was founded 
during the dawn of agriculture roughly 9,500 
years ago, and housed more than 8,000 people 
at its peak. 

The town’ early layers show that residents 
frequently buried their kin under the floors 
of their houses, sometimes with their heads 
severed. Wall paintings also depict the town’s 
residents getting together to tease and kill 
enormous wild bulls for feasting. “The whole 
process of baiting and killing these animals 
would have been extremely intense, and have 
had a major emotional impact,’ says excavation 
director Ian Hodder, an archaeologist at Stan- 
ford University in California. These occasional 
feasts were also memorialized by mounting 
the skulls and horns of bulls inside houses, 
and burying the rest of the bones to com- 
memorate the founding or abandonment of a 
house, which Hodder says were also highly 
ritualistic events. 

Evidence for such imagistic-style rituals 
declines in the later layers of Catalhéyiik. Wild- 
bull rituals and bull-horn installations become 
less common as the herding of domesticated 
sheep, goats and cattle intensified, says Hod- 
der. Human burials within houses fade out, 
and standardized symbolic artefacts, such as 
painted pottery and seal stamps, become more 
common. Whitehouse and Hodder believe 
that these changes represent a shift to a more 
doctrinal mode of ritual as people united into 
a larger, more cooperative community devoted 
to agriculture and animal herding. Although 
speculative, this interpretation is consistent with 
Whitehouse and Atkinson’s cross-cultural sur- 
vey, which found that in contemporary societies 
the doctrinal mode is more established where 
agriculture is practised most intensively. 

Looking beyond Catalhéyiik, Whitehouse, 
Atkinson and Mazzucato are building a regional 
database chronicling similar changes in ritual at 
60 other sites across the Middle East, from the 
end of the Palaeolithic around 10,000 years ago 
until the early Bronze Age around 7,000 years 
ago. This database will dovetail with another 
one that covers the entire world over the past 
5,000 years*. That resource codifies infor- 
mation about the culture, religion and ritual 


practices of people worldwide, and combines 
this with measures of social complexity — for 
example, how many levels of administration 
a society's government has, or the number of 
distinct professions — as well as data on the 
intensity of warfare. The plan is to use this 
database to explore the links between ritual 
and social life, as well as the roles of war and 
competition between societies in nurturing 
certain kinds of ritual and driving increases in 
social complexity. 


Rituals could feed 
conflict by turning 
opinions into 
‘sacred values’. 


Members of the ESRC project are also prob- 
ing people's beliefs about how rituals work. For 
example, Cristine Legare at the University of 
Texas at Austin has studied Brazilian rituals 
called simpatias, which are used to solve every- 
day problems ranging from bad luck to asthma 
and depression®. A simpatia for getting a good 
job says that during the full Moon the jobseeker 
must take the jobs page out of a newspaper, fold 
it four times, and then place it on the floor with 
a small white candle surrounded by honey and 
cinnamon, imagining themself in a new job 
with good pay. The candle stub and the paper 
should be buried with a plant and watered daily, 
and the dream job will soon emerge. 


The ritual mind 

Legare presented Brazilians with a variety of 
simpatias, and found that people judged them 
as more effective when they involved a large 
number of repetitive procedural steps that 
must be performed at a specific time and in 
the presence of religious icons. “We're built to 
learn from others,’ she says, which leads us to 
repeat actions that seemed to work for some- 
one else — “even if we don't understand how 
they produce the desired outcomes”. 

Meanwhile, psychologist Ryan McKay 
at Royal Holloway, University of London, 
and Jonathan Lanman, a cognitive anthro- 
pologist at Queen’s University, Belfast, are 
exploring how rituals can be broken down 
into their component parts and how each part 
influences behaviour. One such component 
is synchronized physical action — for exam- 
ple, the ritualized goose-stepping of military 
units — which social psychologists have 
shown* promotes a sense of connection and 
trust between individuals. 

This work builds on research by Richard 
Sosis, an anthropologist at the University of 
Connecticut, who has shown that immersion 
in collective rituals, such as communal prayer, 
in Israeli kibbutzim increases cooperative 
behaviour in economic games’ — but only 
with other kibbutz members’. 
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Ritual also has its darker side. Surveys by 
Ara Norenzyan, a psychologist at the Univer- 
sity of British Columbia in Vancouver who has 
an advisory role on the project, suggest that 
support for suicide terrorism among Palestin- 
ians is more strongly tied to communal ritual 
attendance than to religious devotion, as meas- 
ured by the frequency of private prayer’. 

Atran thinks that rituals could also feed con- 
flict by turning the opinions and preferences 
of groups into ‘sacred values’ — absolute and 
non-negotiable beliefs that cannot be traded 
against material benefits such as money. For 
many Israelis, for example, one such value is 
the right to occupy the West Bank, whereas 
for many Palestinians it is the right to return 
to the villages from which they were expelled. 
In fact, Atran has found that financial offers 
to compromise on these sacred values makes 
them even more entrenched”. 

As an example of how rituals can cause 
values and preferences to become sacralized, 
Atran points to his studies showing that, in the 
United States, people who attend church more 
frequently are more likely to consider the right 
to bear armsa sacred value”. 

“Emotionally intense rituals have bound 
us together and pitted us against our enemies 
throughout the history of our species,” says 
Whitehouse. “It was only when nomadic for- 
agers began to settle down did we discover 
the possibilities for establishing much larger 
societies based on frequently repeated creeds 
and rituals.” 

The big question, he says, is whether this 
kind of unity can be extended to humanity 
at large. For Whitehouse, understanding the 
ways that rituals shape group behaviour is the 
first step towards finding out how they can be 
harnessed to dampen down conflict between 
groups. He hopes that such insights could 
help policy-makers to “establish new forms of 
peaceful cooperation, as well as bringing down 
dictators”. m 


Dan Jones is a freelance writer in Brighton, 
UK. 
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A satellite image of snow on the Hindu Kush mountains in Asia, with regions of high absorption of sunlight by dust and black carbon shaded in red. 


A vision for data science 


To get the best out of big data, funding agencies should develop shared tools for 
optimizing discovery and train a new breed of researchers, says Chris A. Mattmann. 


r | two small words — ‘big data — 
are getting a lot of play across the 
sciences. Funding agencies, such as 

the National Science Foundation and the 
National Institutes of Health in the United 
States, have created million-dollar pro- 
grammes around the challenges of storing 
and handling vast data streams. Although 
these are important, I believe that agencies 
should focus on developing shared tools for 
optimizing discovery. 

Big data are big in three ways: the volume 
of information that systems must ingest, 
process and disseminate; the number and 
complexity of the types of information 
handled; and the rate at which information 
streams in or out. Terabyte-sized data sets 


(10" bytes) are now common in Earth and 
space sciences, physics and genomics (see 
‘Data deluge’). But a lack of investment in 
services such as algorithm integration and 
file-format translation is limiting the ability 
to manipulate archival data to reveal new 
science. 

At the Jet Propulsion Laboratory (JPL) 
in Pasadena, California, I am a principal 
investigator in a big-data initiative, pursu- 
ing projects on data archiving and mining, 
smart algorithms and low-power hardware 
for astronomy and Earth science. Rather than 
finding one system that can ‘do it all’ for any 
data set, my team aims to define a set of archi- 
tectural patterns and collaboration models 
that can be adapted to a range of projects. 


I believe that four advancements are 
necessary to achieve that aim. Methods for 
integrating diverse algorithms seamlessly 
into big-data architectures need to be found. 
Software development and archiving should 
be brought together under one roof. Data 
reading must become automated among 
formats. Ultimately, the interpretation of 
vast streams of scientific data will require a 
new breed of researcher equally familiar with 
science and advanced computing. 


ALGORITHM INTEGRATION 

A project by my team at the JPL illustrates 
the challenges of working with big data. In 
2011, we were asked by the US National 
Climate Assessment to establish a 
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> computing facility to integrate a range of 
snow-related measurements — and to do so 
ina month. The data included observations 
from the western United States, Alaska and 
the Hindu Kush-Himalayan regions, as well 
as the entire Earth-observing record since 
2000 and subsequent monitoring. The data 
products and maps would amount to several 
hundred terabytes. 

The algorithms to be incorporated were 
varied, and included codes for estimating 
snow coverage, grain size and absorp- 
tion of solar radiation by dust and black 
carbon!. They had been written in IDL, a 
specialized programming language used by 
many researchers. Geographers, remote- 
sensing experts and software programmers 
contributed. 

Most computer scientists would assume 
that such a system would take years, not 
weeks, to develop. The algorithms would 
presumably have to be rewritten in a stand- 
ard language such as C++, Java or Python, 
or one that could run on a fast computer 
system or infrastructure, such as Google's 
MapReduce model. 

But, in my experience, there is no need 
to rewrite scientific algorithms for big- 
data systems. Rewriting only increases the 
barriers to communication between scien- 
tists and computer engineers. Rewriting can 
also introduce costly errors. 

Computer engineers should trust 
scientists to produce executable algo- 
rithms, which can be plugged into a larger 
processing framework. The skill is in tying 
the input and output files and relevant 
parameters unobtrusively into the big- 
data network, so that the algorithm can 
run seamlessly within it. With a modu- 
lar approach, development can proceed 
quickly in parallel — we constructed our 
snow-science computing facility this way 
in less than a month. 


DEVELOPMENT AND STEWARDSHIP 

Today, different big-data computing tasks 
are usually undertaken by different teams. 
The bulk of agency funding goes to building 
specific long-standing archives or data grids’ 
— systems such as the NASA Earth science 
Distributed Active Archive Centers or the 
International Virtual Observatory Alliance in 
astronomy — that disseminate, preserve and 
steward’ data. Large archives have received 
an average of US$100 million a year from US 
federal agencies over the past decade. 

By contrast, the development, integration 
and updating of science algorithms receives 
only between $1 million and $5 million per 
year in the United States. These tasks are 
carried out in science-computing facilities, 
which are often small and transient. Because 
they must do more for less, such facilities 
largely use and generate community-based 
open-source software* ®. Examples include 


DATA DELUGE 


The billions of terabytes (TB) produced in one year 
by the SKA telescope (grey) will dwarf today's data 
sets in genomics and climate science. 


Encyclopedia of 
DNA Elements 
(ENCODE), 2012 ————__® 


15 TB 


US National 
Climate Assessment 
(NASA projects), 2013 


1,000 TB 


Fifth assessment report 
by the Intergovernmental 
Panel on Climate Change 

(IPCC), due 2014 


2,500 TB 
| 


Square Kilometre Array 
(SKA), first light due 2020 


22,000,000,000 TB 


per year 


Apache Hadoop’ and Apache Tika’®, used 
in Earth science, biomedicine and business. 

Although data interpretation and archiv- 
ing efforts have so far been funded separately 
and at strikingly dissimilar levels, their needs 
— such as workflow processing and file and 
resource management — are complemen- 
tary and overlapping. As storage and com- 
putation costs fall, algorithm developers are 
moving into preservation, both to archive 
their own work and to open new research 
windows on large data sets that were previ- 
ously closed. 

In the next decade, I believe that archives 
and science-computing facilities must merge. 
The international radio-astronomy commu- 
nity is doing so in preparation for the Square 
Kilometre Array radio telescope, due to see 
first light in 2020. The enormous volume of 
data that the array will produce — 700 tera- 
bytes each second — will, after just a few 
days, eclipse the current size of the Internet. 
Archives in the United States such as those at 
the National Radio Astronomy Observatory’s 
Expanded Very Large Array and the Atacama 
Large Millimeter/submillimeter Array are 
developing software to handle that deluge. 


MANY FORMATS 

Big-data systems must deal with thousands 
of file types and conventions. The commu- 
nities that have formed around informa- 
tion modelling, ontology and semantic web 
software address this complexity of data and 
metadata (descriptive terms attached to files) 
to some extent. But they have so far relied on 
human intervention. None has delivered the 
silver bullet: automatic solutions that iden- 
tify file types and extract meaningful data 
from them. 


474 | NATURE | VOL 493 | 24 JANUARY 2013 


© 2013 Macmillan Publishers Limited. All rights reserved 


Comparisons of observational and model 
data are, for example, under construction for 
the US National Climate Assessment and the 
Coupled Model Intercomparison Project of 
the Intergovernmental Panel on Climate 
Change. NASA uses the Hierarchical Data 
Format version 5 (HDF-5) and the HDF- 
Earth Observing System metadata repre- 
sentation. The outputs of climate models 
are stored in the Network Common Data 
Form, typically with climate and forecast 
metadata conventions’. Automatic methods 
will be needed to match and analyse these 
data, which amount to petabytes (10° bytes). 

Some big-data fields are switching to 
formats like these that have better sup- 
port. Astronomers, for instance, are turn- 
ing to NASA’s HDF-5 file format from the 
Flexible Image Transport System that has 
been their standard. But history shows 
that defining a single, unifying file for- 
mat is not the answer, because prolifera- 
tion of file types will continue. Instead, we 
need a toolkit of automatic ways to boil 
file formats down to their essence, and 
more formats that are amenable to those 
approaches. We need flexible systems that 
can perform multiple functions and deal 
with diverse data. Encouraging efforts are 
under way, including with Apache OODT” 
and Apache Tika’. 


PEOPLE POWER 

To solve big-data challenges, researchers 
need skills in both science and computing 
— a combination that is still all too rare. A 
new breed of ‘data scientist’ is necessary. 

As well as being data stewards, data 
scientists will develop bespoke algorithms 
for analysis and adapt file formats. They 
will understand the mathematics, stat- 
istics and physics necessary to integrate 
science algorithms into efficient architec- 
tures. They will find solutions beyond the 
fragmented community efforts that have 
dominated the past decade of development 
of big-data systems. 

Funding agencies should support comput- 
ing facilities that combine big-data steward- 
ship and software development, employing 
data scientists to bridge the gap. Coordina- 
tion between agencies is crucial to avoid 
duplication. The Big Data Senior Steering 
Group, linking efforts across the National 
Science Foundation, the National Institutes 
of Health, NASA and others, is a promising 
early example. More oversight will be needed 
to establish new working patterns. 

Because big-data fields stretch across 
national as well as disciplinary boundaries, 
such facilities and panels must be interna- 
tional. In centres of excellence around the 
world, such as the JPL, data scientists will 
help astronomers and Earth scientists to 
share their approaches with bioinformati- 
cians, and vice versa. 


For the specialism to emerge and 
grow, data scientists will have to over- 
come barriers that are common to 
multidisciplinary research. As well as 
acquiring understanding of a range of 
science subjects, they must gain aca- 
demic recognition. Journals such as the 
Data Science Journal should become 
more prominent within the comput- 
ing community. Software products and 
technologies should be valued more by 
academic committees. 

New interdisciplinary courses will 
be needed. The University of Califor- 
nia, Berkeley, and Stanford University 
in California have set up introductory 
courses for computer scientists on big- 
data techniques — more universities 
should follow suit. Natural scientists, 
too, should become familiar with com- 
puting and format issues. 

In my lectures for computer-science 
graduates, I have brought together stu- 
dents at the University of Southern Cali- 
fornia in Los Angeles with researchers at 
the JPL. Using real projects, my students 
see the challenges awaiting them in their 
future careers. I hope to employ some of 
them on the projects that will flow from 
the JPLs big-data initiative. The technolo- 
gies and approaches that they develop will 
spread beyond NASA through contribu- 
tions to the open-source community. 

Empowering students with knowledge 
of big-data infrastructures and open- 
source systems now will allow them to 
make steps towards addressing the major 
challenges that big data pose. = 


Chris A. Mattmann is a senior 
computer scientist at the Jet Propulsion 
Laboratory, California Institute of 
Technology, Pasadena, California 91109, 
USA, and adjunct assistant professor 

in computer science at the University 

of Southern California, Los Angeles, 
California 90089, USA. 

e-mail: chris.a.mattmann@nasa.gov 
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Fuel-efficient cars cost less to run, so people might use them a little more. 


The rebound effect 
is overplayed 


Increasing energy efficiency brings emissions savings. 
Claims that it backfires are a distraction, say Kenneth 
Gillingham and colleagues. 


uy a more fuel-efficient car and you 
B will spend more time behind the 
wheel. That argument, termed the 
rebound effect, has earned critics of energy- 
efficiency programmes a voice in the 
climate-policy debate, for example with an 
article in The New York Times entitled “When 
energy efficiency sullies the environment’. 
The rebound effect idea — and its extreme 
variant the ‘backfire’ effect, in which 


supposed energy savings turn into greater 
energy use — stems from nineteenth-century 
economist Stanley Jevons. In his 1865 book 
The Coal Question, Jevons hypothesized 
that energy use rises as industry becomes 
more efficient because people produce and 
consume more goods as a result’. 

The rebound effect is real and should be 
considered in strategic energy planning. 
But it has become a distraction. A vast > 
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Villagers in Blanche Bay, New Guinea, in the 1890s. 


Power of the past 


Monique Borgerhoff Mulder assesses an exploration of 
how modern industrial and traditional societies differ. 


hen outsiders come into a disci- 
pline and publish a popular book 
that becomes a classic, insiders 


may not be pleased. Typically, there is terri- 
toriality (those ideas aren't yours); pedantry 
(so much inaccuracy); and jealousy (I should 
have done it myself). 

Jared Diamond — once ecologist, now 
ornithologist and geographer by trade — has 
ventured into anthropology twice before. In 
Guns, Germs and Steel (Norton, 1997) he 
offered distinct and powerful hypotheses 
for why different parts of the globe devel- 
oped at such different rates, attracting a wide 
readership to big questions in anthropology 
that few had dared to tackle. In Collapse 
(Viking, 2005), he provided a loosely syn- 
thesized catalogue of the factors associated 
with the downfall of civilizations. Now we 
have The World Until Yesterday, offering the 
personal reflections of a Westerner who has 
visited communities still peripheral to the 
globalized world — thoughts that anthro- 
pologists rarely divulge. 


Diamond uses his 50-odd years’ worth of 
visits to remote New Guinea in the pursuit 
of birding fieldwork to meditate, as the pro- 
logue details, on “how all of our ancestors 
lived for tens of thousands of years, until vir- 
tually yesterday”. He aims to identify what 
we can learn from the foraging and simple 
farming societies that preceded modern 
(and indeed ancient) states. In chapters on 
warfare, child rearing, the care of the elderly 
and health, we are treated to engaging sto- 
ries from New Guinea and samplings from 
ethnographic texts on other traditional pop- 
ulations. The result — which reads in some 
places like a cultural-anthropology textbook 
from the 1970s, in others like a mesmeriz- 
ing thriller — reveals a world in which some 
people strangle their elders rather than book 
them into retirement 
homes, and resolve 
conflicts with stran- 
gers over a ceremo- 
nial feast rather than 
an affidavit. 


NATURE.COM 
For areview of 
Jared Diamond’s 
Collapse, see: 
go.nature.com/8ug879 


BOOKS & ARTS | COMMENT | 


Diamond’s central 
objective is to deter- 
mine whether ‘we’ or 
‘they’ do it better, be 
it in conflict resolu- 
tion, child rearing 
or how we eat. He 
looks for what we can 
learn from them: co- 
sleeping with infants, 
multilingualism, 
restorative justice and 
enriched lives for the 
elderly, with no retire- JARED DIAMOND 
ment age, are some Viking: 2012. 512 pp. 
highlights. $36 

Many anthropolo- 
gists will undoubtedly object to this us-ver- 
sus-them framing, although for a popular 
readership it is clearly thought-provoking. 
Human diversity in social organization can- 
not be dichotomized as traditional versus 
modern. Diamond recognizes this in his 
preface, but thereafter chooses to ignore it. 
Instead, he compares foraging and hoe-cul- 
tivating societies in Africa, South America 
and New Guinea with communities typical 
of modern industrial states, primarily the 
United States. However, he does not address 
the intriguing question of how power, status 
and resources are distributed among indi- 
viduals across the diverse kinds of socie- 
ties that have existed in human history, or 
the implications of these distributions for 
human welfare. 

Later chapters on danger, language, reli- 
gion and health revert more to the style of 
argument Diamond used in Guns, Germs 
and Steel, with appeals to evolutionary and 
ecological explanations. 

Apropos of danger, Diamond tells a ripping 
yarn of his canoe-wreck at dusk in the Indo- 
nesian archipelago. This ordeal suggested to 
him the concept of “constructive paranoia’, 
an alertness to risk that he believes might 
be adaptive for traditional peoples living in 
dangerous environments. With regard to 
language, he follows others in noting that lin- 
guistic diversity maps onto high net primary 
productivity, low seasonality and low human- 
population mobility — in other words, lan- 
guages diversify where there is plentiful food 
available all year and human settlements are 
not forced to move. And, in the theme of ear- 
lier chapters, he promotes language diversity 
as something to strive for, citing the cognitive 
benefits associated with bilingualism. 

Evolution by natural selection appears 
only late in the book, in a well-balanced 
discussion of the multiple functions of reli- 
gion and how these change in importance 
over time. Supernatural explanations for 
unforeseen events have declined as scien- 
tific thought gained ascendancy, whereas 
the provision of comfort, hope and meaning 
has become more prominent over time > 


The World Until 
Yesterday: What 
Can We Learn 
from Traditional 
Societies? 
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> as inequalities among citizens have 
escalated. Again, the material draws on 
the work of other scholars, but Diamond 
suggests, interestingly, that US citizens 
may be more religious than those of 
other industrialized nations because of 
the high levels of economic inequality in 
their country (religion helps to rational- 
ize prosperity for those at the top of the 
ladder and mitigate adversity for those at 
the bottom). 

Finally, in a chapter on health, Dia- 
mond recapitulates arguments for the 
‘diseases of civilization’ — that the fats 
and salts we consume do not suit physi- 
ologies designed for foraging, and cause 
the hypertension and diabetes crises that 
we see in populations transitioning to 
modern lifestyles. With reference to James 
Neel’s ‘thrifty gene’ hypothesis, Diamond 
attributes the particularly high incidence 
of type 2 diabetes among Pima Native 
Americans and Nauruan Pacific Island- 
ers to catastrophic recent famines in these 
populations, which may have favoured 
the survival of individuals who most effi- 
ciently converted sugars to fats. He also 
proposes that Europeans may have been 
through a gradual and largely undetect- 
able diabetes crisis, starting in the fif- 
teenth century, as food security increased 
with the development of state distribution 
systems — with Johann Sebastian Bach 
named as one possible victim. 

Diamond has an engaging style, and 
has packed the book with grisly anec- 
dotes and delightful detail from the jun- 
gles of New Guinea to the strip malls of 
Los Angeles, California, alerting us to 
grim realities of the industrial world. 
For example, a single pan-fried-noodle 
combo dish served in Los Angeles 
contains more than a year’s worth of a 
Yanomamii Indian’ salt intake. The book 
brings anthropology alive for those who 
have never had the privilege of visit- 
ing, or reading deeply about, societies 
very different from their own. For those 
who have worked in the remote reaches 
of the world, it may tire or even annoy. 
Tire, because the book is long, and draws 
on overly familiar ethnographies; annoy, 
because the scale of comparison, us ver- 
sus them, is conceptually limiting. 

Diamond has previously described his 
writings on the cultures of New Guinea 
as journalism. Perhaps this is how we 
should read The World Until Yesterday, 
as a highly personal reflection on the 
virtues and vices of modern industrial 
civilization. m 


Monique Borgerhoff Mulder is a 
human behavioural ecologist at the 
University of California, Davis, USA. 
e-mail: mborgerhoffmulder@ucdavis.edu 
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One of the vessels captained by US explorer Richard Byrd in Antarctica in the 1930s. 


Frozen assets 


Edmund Stump welcomes a history of Antarctica that 
covers the glory, the rivalries and the scientific legacy. 


Terra Australis Incognita in 1772-75 

to today’s shifting international popu- 
lation of researchers, Antarctica’s history is 
stamped by the continent's remote isolation, 
extreme climate and scientific importance. 
Its strategic and economic potential have 
not escaped government attention. Behind 
the scenes, nations have long plotted to win 
sovereignty and control resources. And that 
is where David Day’s Antarctica: A Biogra- 
phy takes us, into a two-faced world of pub- 
lic and covert intentions where personal and 
national rivalries abound. 

Antarctica is the first comprehensive his- 
tory of the continent, spanning the centuries 
since Cook’s voyage. The heroic expeditions 
of Robert Falcon Scott, Roald Amundsen 
and other luminaries are all there, as are 
many that are less well known. Day couches 
these throughout in political contexts — 
how they served the motives of the colonial 
offices or state departments back home. The 
dramas, played out in secret memos and in 
published statements in newspapers, give the 
booka slow, even glacial, pace at times. 

At every turn there is hesitancy on the part 
of diplomats and leaders: to claim or not to 
claim, to recognize others’ claims or not, to 
offend trading partners or not, to respond to 
official memos or to let them go unanswered. 
In this way we have collectively backed into 
today’s Antarctica, where all territorial claims 
are held in abeyance, a 50-year moratorium 
prohibits exploration for mineral and energy 
resources, and tourists flock in droves. 


| bias James Cook’s circumnavigation of 
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Day has done a_ Antarctica: 
remarkable job of A Biography 
collating information —PAVID DAY 
from rich and varied — Q*"0/d Univ Press: 

; ; 2013. 624 pp. £25 
international sources. 

He draws from original accounts, newspa- 
per articles, the recently released papers of 
US naval officer and polar explorer Rich- 
ard Byrd, and numerous national archives 
with their copious committee reports and 
memos, some recently declassified. 

The concealment of who made what 
claims where and when, and how different 
versions were presented to the public, domi- 
nate the book. One example is what Byrd told 
reporters in Dunedin, New Zealand, after 
his first Antarctic expedition (1928-30). 
Although he said he was “not the least con- 
cerned with claiming the land for America’, 
he had recently photographed miles of 
mountains and coastline in Marie Byrd Land 
with just that intent. He had also instructed 
Laurence Gould, his second in command, to 
leave a note in a cairn claiming the territory 
for the United States when Gould’s ground 
party traversed beyond the boundary of New 
Zealand's Ross Dependency. 

Although I am familiar with the history 
of exploration in the Peninsula and Ross 
Sea sectors of Antarctica, Day expanded my 
horizons to the ‘African’ sector, the section 
of East Antarctica facing the tip of Africa. 
Here, between the world wars, Norwegians 
led the last great slaughter of Antarctic whales 
and mapped major sectors of the coastline, 
infringing on claims by the ever-vigilant 
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> as inequalities among citizens have 
escalated. Again, the material draws on 
the work of other scholars, but Diamond 
suggests, interestingly, that US citizens 
may be more religious than those of 
other industrialized nations because of 
the high levels of economic inequality in 
their country (religion helps to rational- 
ize prosperity for those at the top of the 
ladder and mitigate adversity for those at 
the bottom). 

Finally, in a chapter on health, Dia- 
mond recapitulates arguments for the 
‘diseases of civilization’ — that the fats 
and salts we consume do not suit physi- 
ologies designed for foraging, and cause 
the hypertension and diabetes crises that 
we see in populations transitioning to 
modern lifestyles. With reference to James 
Neel’s ‘thrifty gene’ hypothesis, Diamond 
attributes the particularly high incidence 
of type 2 diabetes among Pima Native 
Americans and Nauruan Pacific Island- 
ers to catastrophic recent famines in these 
populations, which may have favoured 
the survival of individuals who most effi- 
ciently converted sugars to fats. He also 
proposes that Europeans may have been 
through a gradual and largely undetect- 
able diabetes crisis, starting in the fif- 
teenth century, as food security increased 
with the development of state distribution 
systems — with Johann Sebastian Bach 
named as one possible victim. 

Diamond has an engaging style, and 
has packed the book with grisly anec- 
dotes and delightful detail from the jun- 
gles of New Guinea to the strip malls of 
Los Angeles, California, alerting us to 
grim realities of the industrial world. 
For example, a single pan-fried-noodle 
combo dish served in Los Angeles 
contains more than a year’s worth of a 
Yanomamii Indian’ salt intake. The book 
brings anthropology alive for those who 
have never had the privilege of visit- 
ing, or reading deeply about, societies 
very different from their own. For those 
who have worked in the remote reaches 
of the world, it may tire or even annoy. 
Tire, because the book is long, and draws 
on overly familiar ethnographies; annoy, 
because the scale of comparison, us ver- 
sus them, is conceptually limiting. 

Diamond has previously described his 
writings on the cultures of New Guinea 
as journalism. Perhaps this is how we 
should read The World Until Yesterday, 
as a highly personal reflection on the 
virtues and vices of modern industrial 
civilization. m 


Monique Borgerhoff Mulder is a 
human behavioural ecologist at the 
University of California, Davis, USA. 
e-mail: mborgerhoffmulder@ucdavis.edu 
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One of the vessels captained by US explorer Richard Byrd in Antarctica in the 1930s. 


Frozen assets 


Edmund Stump welcomes a history of Antarctica that 
covers the glory, the rivalries and the scientific legacy. 


Terra Australis Incognita in 1772-75 

to today’s shifting international popu- 
lation of researchers, Antarctica’s history is 
stamped by the continent's remote isolation, 
extreme climate and scientific importance. 
Its strategic and economic potential have 
not escaped government attention. Behind 
the scenes, nations have long plotted to win 
sovereignty and control resources. And that 
is where David Day’s Antarctica: A Biogra- 
phy takes us, into a two-faced world of pub- 
lic and covert intentions where personal and 
national rivalries abound. 

Antarctica is the first comprehensive his- 
tory of the continent, spanning the centuries 
since Cook’s voyage. The heroic expeditions 
of Robert Falcon Scott, Roald Amundsen 
and other luminaries are all there, as are 
many that are less well known. Day couches 
these throughout in political contexts — 
how they served the motives of the colonial 
offices or state departments back home. The 
dramas, played out in secret memos and in 
published statements in newspapers, give the 
booka slow, even glacial, pace at times. 

At every turn there is hesitancy on the part 
of diplomats and leaders: to claim or not to 
claim, to recognize others’ claims or not, to 
offend trading partners or not, to respond to 
official memos or to let them go unanswered. 
In this way we have collectively backed into 
today’s Antarctica, where all territorial claims 
are held in abeyance, a 50-year moratorium 
prohibits exploration for mineral and energy 
resources, and tourists flock in droves. 


| bias James Cook’s circumnavigation of 
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Day has done a_ Antarctica: 
remarkable job of A Biography 
collating information —PAVID DAY 
from rich and varied — Q*"0/d Univ Press: 

; ; 2013. 624 pp. £25 
international sources. 

He draws from original accounts, newspa- 
per articles, the recently released papers of 
US naval officer and polar explorer Rich- 
ard Byrd, and numerous national archives 
with their copious committee reports and 
memos, some recently declassified. 

The concealment of who made what 
claims where and when, and how different 
versions were presented to the public, domi- 
nate the book. One example is what Byrd told 
reporters in Dunedin, New Zealand, after 
his first Antarctic expedition (1928-30). 
Although he said he was “not the least con- 
cerned with claiming the land for America’, 
he had recently photographed miles of 
mountains and coastline in Marie Byrd Land 
with just that intent. He had also instructed 
Laurence Gould, his second in command, to 
leave a note in a cairn claiming the territory 
for the United States when Gould’s ground 
party traversed beyond the boundary of New 
Zealand's Ross Dependency. 

Although I am familiar with the history 
of exploration in the Peninsula and Ross 
Sea sectors of Antarctica, Day expanded my 
horizons to the ‘African’ sector, the section 
of East Antarctica facing the tip of Africa. 
Here, between the world wars, Norwegians 
led the last great slaughter of Antarctic whales 
and mapped major sectors of the coastline, 
infringing on claims by the ever-vigilant 
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Australians. During the 1929-30 summer 
season, for instance, the Norwegians and 
Australians converged on Enderby Land 
intent on extending their claims to the same 
territory and forestalling the other's. 

Lalso enjoyed the account of the jostling 
at Deception Island during and immediately 
after the Second World War, when Argen- 
tina painted its colours on the abandoned 
British station. After the British removed 
them, subsequent claims were made by Chile 
and Argentina, ultimately leading to ‘Opera- 
tion Tabarin’ — a secret UK mission charged 
with building and maintaining permanent 
British stations on the Peninsula. Follow- 
ing the war, scientific activity increased and 
responsibility was transferred to the Falk- 
land Island Dependencies Survey, which in 
1962 became the British Antarctic Survey. 

Each chapter covers a specific period in 
dense detail. There is grist here for the rumi- 
nations of historians, although a more casual 
reader might find it hard to digest at times. 
But the final chapter, covering the period 
since the 1961 signing of the Antarctic Treaty, 
breezes through the turning points. 

I was reminded of how, in June 1988, the 
Convention on the Regulation of Antarctic 
Mineral Resource Activities had been adopted 
and was poised for ratification by the treaty 
nations. Then, in January 1989, the Argen- 
tinian ship Bahia Paraiso ran aground off the 
Antarctic Peninsula, spilling oil into the sur- 
rounding waters. Two months later, the Exxon 
Valdez went down in Prince William Sound, 
Alaska, emptying 260,000 barrels of crude oil 
into similarly frigid waters. Countries that had 
quietly endorsed the convention scrambled 
to withdraw their support, and within two 
years the Protocol on Environmental Pro- 
tection to the Antarctic Treaty (the Madrid 
Protocol), was in place, prohibiting the explo- 
ration and exploitation of mineral resources 
on the continent for 50 years. 

But marine resources remain a contentious 
issue, as witnessed by the failure in Hobart 
last year of the Commission for the Conser- 
vation of Antarctic Marine Living Resources 
to establish proposed marine reserves in 
critical areas in the Southern Ocean. 

Day ends by saying, “For centuries, the 
Antarctic defied man’s approach. Now its 
dangers and its terrors have been largely con- 
quered. Only its future remains unknown.” I 
would add that, thanks to Day, the intrigues 
and posturing that saturate the history of this 
distant land have now been exposed. We are 
left to wonder what continues to be plotted 
behind the scenes. m 


Edmund Stump is a professor of exploration 
at Arizona State University who has led many 
Antarctic field projects. He is the author 

of The Roof at the Bottom of the World: 
Discovering the Transantarctic Mountains. 
e-mail: ed.stump@asu.edu 


Books in brief 


The Physics of Wall Street: A Brief History of Predicting the 
Unpredictable 

James Owen Weatherall HOUGHTON MIFFLIN HARCOURT 304 pp. 

$27 (2013) 

Wall Street’s 2008 meltdown is often pinned on ‘quants’, the 
physicists and mathematicians who invented financial instruments 
such as derivatives. Physicist James Owen Weatherall argues that it 
was less about the models than a catastrophic misuse of them. He 
bolsters his view with a concise history of scientific bravehearts out 
to tame the market, from Louis Bachelier, pioneer of the maths of 
financial markets, to recent risk-wranglers such as Didier Sornette. 


What Has Nature Ever Done For Us? How Money Really Does Grow 
On Trees 

Tony Juniper PROFILE Books 256 pp. £9.99 (2013) 

Microbes restore soils, trees oxygenate the atmosphere, vultures 
act as sanitation crews. And, says Tony Juniper in this crisp call 

to action, such ‘ecosystem services’ could back the currency of a 
new economics. The concept is hardly new, but Juniper — former 
head of Friends of the Earth — declares its time has come. As he 
trawls the biosphere and its rich array of services, his insights on 
environmental threats and solutions, backed by robust findings, 
form a pragmatic argument for a return to ‘gardening the Earth’. 


What’s Wrong with Fat? 

Abigail C. Saguy OXFORD UNIVERSITY PRESS 272 pp. $29.95 (2013) 
Obesity is an international epidemic. Or is it? Sociologist Abigail C. 
Saguy weighs in with an investigation of fatness that looks 
provocative, yet is thoughtful and thorough. By examining heft 
through many lenses, from the ethical to the scientific, Saguy traces 
how labelling fat as a disease or as an indicator of immorality seeps 
into society. She argues that public-health diktats are issued despite 
scientific debate over the condition; and that factors in fatness, such 
as poverty, are often ignored. Ultimately, she avers, stigmatizing the 
condition serves to embed it further. 


S : Visions of a Vanished World: The Extraordinary Fossils of the 
Visions of a Hunsriick Slate 


Vanish, 
World - Gabriele Kuhl, Christoph Bartels, Derek E. G. Briggs and Jes Rust 


wine YALE UNIVERSITY PRESS 128 pp. $40 (2012) 

Some 400 million years ago, catastrophic storms and seismic 
shifts may have triggered the release of a vast slick of sediment that 
buried thousands of soft-bodied marine organisms on the ocean 
floor. Germany’s Hunsriick Slate is the result: a frozen slice of the 
Devonian, capturing trilobites, ammonites and more in their last 
/ second of life. Pyritization has left exquisitely detailed, golden-hued 

fossils, hauntingly displayed in this coffee-table volume. 


Saltpeter: The Mother of Gunpowder 

David Cressy OXFORD UNIVERSITY PRESS 356 pp. £16.99 (2013) 
Gunpowder obsessed early-modern Europe and Islamic Asia. But its 
key ingredient — potassium nitrate, or saltpetre — had either to be 
extracted from excrement- and urine-drenched soils, or imported. 
The scramble to get enough is the focus for this science-tinged 
history of a Britain building up to empire and enlightenment. David 
Cressy tells the tale of this “crucial link in the chain of chemistry and 
power” with panache, from the unravelling of saltpetre’s chemistry 
to the warmongering that fostered dependence on it. 
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Science petitions are 
a facade of numbers 


Scientists and science 
organizations increasingly use 
petitions to influence public 
opinion and government 
decisions, and the practice is now 
spilling over into peer-reviewed 
journals. We question the 
effectiveness of public petitions, 
and suggest that ganging up 
against unpopular research risks 
damaging scientific discourse. 
Petitioners recruit 
researchers as co-signatories 
to reinforce their argument, 
often for people who may not 
fully understand the debate. 
Last year, for example, 31,000 
scientists signed a petition 
rejecting the idea of global 
warming (www.petitionproject. 
org) — but a counter-petition 
with 31,001 signatures won't 
resolve the debate. Numbers 
and scientific argument 
generally have little influence 
on public opinion or political 
decisions (D. Ding et al. Nature 
Clim. Change 1, 462-466; 2011), 
partly because of the difficulty 
in assessing the authority of 
individual co-signatories. 
‘Gang science is being used 
to quash unpopular ideas in 
peer-reviewed journals. For 
example, 141 scientists mounted 
an attack on proposals for 
managing introduced species 
(D. Simberloff et al. Nature 
475, 36; 2011), and 137 others 
challenged a paper written by 
three authors on the evolution 
of eusociality (P. Abbot et al. 
Nature 471, E1-E4; 2011). In 
the absence of new data, such 
huge conglomerates contribute 
little more than intimidation. 
We should judge the validity 
of scientific ideas on hard, 
replicable data and not on the 
number of authors. Otherwise, 
scientists risk being branded as 
another advocacy group with its 
own agenda. 
Robert J. Warren Buffalo State, 
State University of New York, 
Buffalo, New York, USA. 
warrenrj@buffalostate.edu 
Mark A. Bradford Yale 


University, New Haven, 
Connecticut, USA. 


Small collections 
make a big impact 


In an era in which support for 
natural-history collections is 
waning, we wish to point out 
how effective even a small, young 
collection can be. 

We constructed a Google 
Scholar profile (called UAM 
Birds) of publications that used 
the bird collection we oversee 
at the University of Alaska 
Museum in Fairbanks. The 
collection is supported by 1.3 
full-time-equivalent staff, and 
it served in whole or in part 
as research infrastructure for 
these publications, contributing 
and preserving specimens and 
associated information. 

The body of work supported 
by the collection is diverse 
and well cited, with a profile 
h-index of 42, equivalent to an 
average Nobel laureate in physics 
(J. E. Hirsch Proc. Acad. Natl 
Acad. Sci. USA 102, 16569- 
16572; 2005). This positively 
sings ‘good investment’ and 
should encourage other 
institutions to rediscover 
and reinvest in collections as 
important societal resources. 
Kevin Winker, Jack J. Withrow 
University of Alaska Museum, 
Fairbanks, Alaska, USA. 
kevin. winker@alaska.edu 


Obesity: multiple 
factors contribute 


The idea that obesity is mainly 
caused by a high intake of 
carbohydrate is only one of many 
possible explanations (G. Taubes 
Nature 492, 155; 2012). The 
energy in-energy out hypothesis 
may also be valid, but it is likely to 
be an oversimplification. 

It is important to consider 
the body’s requirements for key 
nutrients such as the essential 
amino acids, as well as for energy. 
A high-energy diet deficient in 
these can lead to malnutrition 
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coupled to obesity. 

Scientists need to take a closer 
look at the effects of dietary 
protein, lipid and carbohydrate 
on appetite and food intake. 
These should be assessed in the 
context of hormone regulation of 
intermediary metabolism and the 
large variation in basal metabolic 
rate among individuals. 

We shall then have a more 
accurate basis for advising 
Western populations on how to 
avoid obesity. 

Kristin Hamre National 
Institute of Nutrition and Seafood 
Research, Bergen, Norway. 
kha@nifes.no 


Obesity: appetite 
hormone weighs in 


Gary Taubes suggests that obesity 
researchers are currently focusing 
on energy balance at the expense 
of hormonal factors (Nature 492, 
155; 2012), but hormonal factors 
have always been an important 
part of obesity research. 

The German physician 
Bernard Mohr first described 
human obesity associated with 
abnormalities of the basal 
hypothalamus more than 170 
years ago ( Wochenschr. Ges. 
Heilkunde 6, 565-571; 1840). In 
the second half of the twentieth 
century, a negative feedback 
loop gradually became evident 
between the hormone leptin 
(produced by fat cells) and the 
hypothalamus, which has turned 
out to be a crucial regulator of 
levels of body fat. 

Since leptin’s discovery in 
1994, nearly 10,000 papers have 
appeared in PubMed on the 
involvement of leptin signalling 
in obesity. 

Stephan J. Guyenet University of 
Washington, Seattle, USA. 
guyenet@uw.edu 


Undo NIH policy to 
ease effect of cuts 
You say that a petition by US 


biomedical scientists against 
the one-retry limit on grant 


applications to the National 
Institutes of Health (NIH) 

was “beside the point’, despite 
acknowledging the validity of 
their arguments (Nature 492, 7; 
2012). As the petition’s author, I 
disagree: trying to correct a poor 
policy decision is not the entire 
point, as we all recognize, but 
itis most certainly part of the 
point. 

We believe that the petition 
— which has now been signed 
by more than 3,000 scientists 
(see go.nature.com/x5tik5) — 
outweighs the questionable 
arguments for retention of the 
policy put forth by the NIH 
(see go.nature.com/mwfgel). 

In our view, NIH study 
sections cannot distinguish the 
quality of proposals in the first 
quartile. Therefore, once the 
percentage of applications being 
funded falls significantly below 
the 25th percentile, those in the 
first quartile that go unfunded 
should be allowed to resubmit as 
many times as they wish because 
they are equal in merit to those 
that are funded. 

We are not in the grip of 
an “unhealthy” obsession, as 
you suggest. The NIH’s unfair 
decision could be undone with 
the stroke of a pen, which would 
help to ease these tough times 
for biomedical research. We urge 
scientists to continue working on 
all fronts to contain the damage 
caused by reduced federal 
spending on research. 

Robert Benezra Memorial 
Sloan-Kettering Cancer Center, 
New York, USA. 
benezrar@mskcc.org 


Chance thrown by 
inaccurate dice 


The beauty of six-sided dice is 
that the spots on opposite sides 
add up to seven — a six anda 
one, five and two, and four and 
three. Your illustration wrongly 
depicts these pairs on adjacent 
sides (Nature 492, 34-36; 2012). 
Janet K. Burg Eugene, Oregon, 
USA. 

bmeerkat@comcast.net 
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Strength in numbers 


Aprocess called long-term potentiation mediates information storage — learning and memory — at the level of neurons. 
An in vitro study turns the molecular understanding of this process on its head. But researchers’ opinions differ as to what 
can be inferred from these data. SEE ARTICLE P.495 


THE PAPER IN BRIEF 

@ Information storage involves stimulation- 
induced strengthening of synaptic 
communication between neurons over a 
long time period. 

@ For some forms of memory, this 
phenomenon, called long-term potentiation 
(LTP), occurs in the brain’s hippocampus 
region and involves AMPA receptors, which 
are activated by the neurotransmitter 
glutamate. 

@ It is widely believed that the cytoplasmic 
tail of GIUA1, which is one of four isoforms 
that can constitute the four-subunit AMPA 
receptor, is essential for trafficking of 


One step forward 
MORGAN SHENG 


he surprising findings of Granger et al.’ 
invoke a possible shift in our view of how 
AMPA receptors are recruited to synapses. 
The prevailing mechanistic view of LTP has 
focused on the GluA1 and GluA2 subunits 
of AMPA receptors, whose cytoplasmic tails 
(C-tails) dangle inside the cell. A variety of pro- 
teins bind differentially to the C-tails of GluA1 
and GluA2, and disruption of these protein 
interactions affect AMPA-receptor distribu- 
tion and trafficking in neurons’. An appeal- 
ing, albeit unproven, hypothesis is that protein 
interactions with the C-tails of GluA subunits 
— interactions that are somehow regulated 
by synaptic activity — would usher AMPA 
receptors into synapses to induce LTP. 
Indeed, compelling studies from Malinow 
and co-workers concluded that specifically 
GluA1 and its C-tail are essential for the deliv- 
ery of AMPA receptors to synapses during 
LTP (for a review, see ref. 3). However, these 
experiments relied largely on overexpression of 
GluA1 and/or GluA2 in hippocampal neurons 
in which normal endogenous AMPA receptors 
were still present and so possibly interfered 
with trafficking of the exogenous subunits. 
In Granger and colleagues’ experiments, 


the receptor to synapses to mediate 

LTP (Fig. 1). 

© Granger et al.’ carried out a single-cell- 
based study to investigate the minimum 
amino-acid sequence of this tail that is 
required for LTP. 

@ They found that the GluA1 cytoplasmic tail 
was not required for LTP at all. 

@ In fact, the expression of a different 
glutamate receptor, the kainate receptor, was 
sufficient for LTP to occur. 

@ The only prerequisite for this process 
seemed to be a large reserve pool of 
glutamate receptors that could move to 
synapses to mediate LTP. 


the genes encoding GluA1, GluA2 and GluA3 
were deleted, and so endogenous AMPA 
receptors were absent. It was in this setting 
that the authors found that not only were the 
known protein interactions of the GluA1 C-tail 
unnecessary for LTP, but also GluA2 (previously 
thought not to support LTP’) and even kainate 
receptors were sufficient to mediate LTP. 

This ‘molecular replacement of GluA sub- 
units in neurons that are devoid of AMPA 
receptors can be rejected as being just as arti- 
ficial as the experiments overexpressing them 
in the presence of endogenous AMPA recep- 
tors. Nonetheless, Granger and co-authors’ 
data are intriguing because they show that, at 
least in the context of neurons lacking AMPA 
receptors, several different kinds of glutamate 
receptor can be recruited to synapses and are 
sufficient to support LTP — irrespective of 
their C-tails and presumably regardless of their 
associated proteins and accessory subunits. 

This conclusion turns the mechanistic spot- 
light on the overall structural rearrangements 
that occur in the synapse during LTP — espe- 
cially changes to a structure called the post- 
synaptic density, in which AMPA receptors are 
clustered’ — and how such rearrangements 
might lead to the capture of AMPA receptors. 
Indeed, the abundance of scaffolding proteins, 
such as PSD-95, capable of corralling AMPA 
receptors in the synapse, is regulated by phos- 
phorylation, and such proteins may have an 
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underestimated role in synaptic plasticity*°— 
the process by which synapses grow stronger 
or weaker depending on their activity level. 

Even if changes in synapse architecture are 
the main drivers for trapping AMPA recep- 
tors during LTP, it is still difficult to imagine 
how this mechanism could operate completely 
independently of the C-tails of the receptors or 
their accessory subunits. To resolve the roles of 
the different GluA subunits and their C-tails in 
LTP, it would be helpful to generate data from 
more subtle genetic experiments in vivo — in 
which, for example, the C-tail of endogenous 
GluA1 is replaced with the C-tail of GluA2, 
without otherwise altering the expression of 
the proteins. 


Morgan Sheng is in the Department 

of Neuroscience, Genentech Inc., 

South San Francisco, California 94080, USA. 
e-mail: sheng. morgan@gene.com 


Two steps back 
ROBERTO MALINOW & RICHARD HUGANIR 


L Granger and co-workers’ experimen- 
tal set-up, synapses can recruit any type 
of glutamate receptor to mediate LTP — a 
remarkable finding that suggests an incredible 
versatility for synapses. But does such behav- 
iour occur when synapses contain their normal 
complement of proteins? 

Genetically deleting all AMPA receptors, as 
the authors have done in their elegant system, 
may fundamentally change AMPA-receptor 
trafficking compared with that in normal 
synapses. Indeed, GluA1 subunits lacking 
their C-tail could reach synapses only when 
all AMPA receptors were deleted and not in the 
more normal situation, in which other AMPA- 
receptor subunits were present. Our interpre- 
tation of this observation is that the C-tail 
endows GluA1 with a competitive advantage 
to reach the synapse. 

To address the mechanisms underlying LTP 
at normal synapses, more subtle experimen- 
tal manipulations may be more informative; 


Neuron 


Synapse 


Presynaptic 
neuron 


Postsynaptic 
neuron 


Cell 
membrane 


Figure 1 | AMPA receptors at the synapse. At the level of neurons, long-term potentiation (LTP) is mediated 
by AMPA receptors, which are transported from intracellular compartments and elsewhere in the cell membrane 
to the membrane of the postsynaptic neuron. Surprisingly, Granger et al.' report that the cytoplasmic tail 
of the GluA1 subunit of AMPA receptors is not required for this trafficking and thus for LTP. Instead, they 
propose that any type of glutamate receptor can support LTP, as long as enough of them reach the membrane. 


these could include introducing mutations 
(‘knock-in’ mutations) into the genes encoding 
GluA1 and GluA2 to alter their C-tail structure 
without affecting their expression level. In fact, 
previous studies”* involving such mutations 
have shown that the C-tail phosphorylation 
sites and protein-interacting domains of these 
subunits are important for several forms of 
synaptic plasticity, including LTP. 

Synapses lacking AMPA receptors may 
behave differently from normal synapses 
because loss of some essential regulatory pro- 
teins makes them less selective. Are AMPA- 
receptor subunits responsible for maintaining 
such crucial proteins at the synapse? It could 
be that synapses without AMPA receptors — 
such as those that are prevalent early in devel- 
opment’ — have lower selectivity for receptors 
during LTP than do more mature synapses con- 
taining AMPA receptors. Alternatively, there 
may be competition for entering the synapse 
between different receptors that have different 
interaction partners at the synapse to control 
their lifetime there. If that is the case, if a com- 
petitor with several such partners, say AMPA 
receptors, is removed, then other receptors with 
minimal numbers of partners and interactions 
could enter and remain in the synapse. 

Granger and co-authors’ study, therefore, 
may point to a novel view that synapses are 
sites of competition: how efficiently a recep- 
tor enters the synapse depends not only on its 
structural components, but also on those of its 
competitors. Such factors may determine how 
AMPA receptors get into and out of a synapse 
during conditions of synapse maintenance and 
plasticity. 

But whatever the mechanism involved, the 
present study will focus future research on the 


structural changes that occur at the synapse 
after LTP. Many scaffolding proteins regulate 
the complex structure of the postsynaptic den- 
sity, and, according to Granger et al., structural 
changes may occur that do not require specific 
glutamate receptor types. However, the iden- 
tity of these common scaffolding proteins is 
not obvious, because most such proteins are 
known to be specific for AMPA or kainate 
receptors. In fact, some of the same authors 
have previously shown that AMPA-receptor- 
associated proteins known as TARPs are 
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crucial for LTP”. Yet the present study refutes 
those data and shows that even kainate recep- 
tors, which do not interact with TARPs, can 
completely rescue LTP in the absence of 
AMPA receptors. 

So, one step forward, two steps back. The 
search for the mechanism underlying LTP is 
well into its third decade, and it looks as if it 
will continue to fascinate and elude neurosci- 
entists for the next decade, too. = 


Roberto Malinow is in the Center for Neural 
Circuits and Behavior, Division of Biology, 
Neurobiology Section, and the Department 
of Neurosciences, University of California, 
San Diego, La Jolla, California 92093, USA. 
Richard Huganir is in the Department 

of Neuroscience, Howard Hughes Medical 
Institute, Johns Hopkins University School of 
Medicine, Baltimore, Maryland 21205, USA. 
e-mails: rmalinow@ucsd.edu; 
rhuganir@jhmi.edu 
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Biofuel production 
on the margins 


An analysis shows that fuel made from wild, herbaceous vegetation grown on land 
currently unsuitable for cultivating field crops could contribute substantially to 
the United States’ targets for biofuel production. SEE LETTER P.514 


KLAUS BUTTERBACH-BAHL & RALF KIESE 


he governments of more than 35 coun- 
tries, including the United States, Brazil 
and members of the European Union, 
have established policies promoting the pro- 
duction and use of biofuels’. This is driven by 
an interest in becoming more independent of 
fossil-fuel imports and reducing the climate 
footprint of their economies. However, the 
climate benefit of replacing fossil fuels with 


biofuels is strongly disputed, because of the 
lack of compelling evidence to show that bio- 
fuels are indeed associated with much lower 
greenhouse-gas emissions than fossil fuels 
when the full life cycle of their production and 
use is taken into account. On page 514 of this 
issue, Gelfand et al.” report that certain wild, 
herbaceous vegetation, growing on ‘marginal’ 
lands currently unsuitable for arable farming, 
can be used as a biofuel crop, and does sub- 
stantially mitigate greenhouse-gas emissions 
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50 Years Ago 


International Tables for X-ray 
Crystallography. General Editor: 
Dame Kathleen Lonsdale — The 
fifty years which have elapsed 

since the discovery of X-ray 
diffraction by crystals have 
witnessed the development of 
X-ray crystallographic techniques 
as a structure-determining tool of 
unprecedented power and catholicity 
of application. By it the complexities 
of mineral structures have been 
rationalized ... and the elaborate 
architecture of the giant globular 
proteins mapped out. Fortunately, 

it was recognized early that this 
diversity would make discipline 

in the presentation of results, and 
uniformity of nomenclature and 
convention particularly desirable. 
The first attempt to provide such 

an authoritative basis was by 

the Internationale Tabellen zur 
Bestimmung von Kristallstrukturen 
of 1935. In 1946 the International 
Union of Crystallography decided 
onacomplete revision and 
extension of these tables under 

the general editorship of Dame 
Kathleen Lonsdale. Two volumes ... 
have already appeared; Volume 3, 
dealing with physical and chemical 
tables, represents the completion 

of the 1946 project ... No X-ray 
crystallographical laboratory 
worthy of the name will fail to add 
this magnificently printed and 
luxuriously bound volume to the two 
they already should possess. Struther Arnott 
From Nature 26 January 1963 


100 Years Ago 


It is not at all difficult to measure the 
ionisation produced by the radiation 
reflected by crystals, as indeed Prof. 
Barkla has already suggested. Using 
a sheet of mica anda pencil of a few 
millimetres diameter, I find it possible 
to follow with an ionisation chamber 
the movement of the reflected spot 
while the mirror is rotated. W.H. Bragg 
From Nature 23 January 1913 


compared with fossil-fuel use — even rival- 
ling the benefits associated with growing tra- 
ditional biofuel crops, such as maize (corn)*. 

When assessing the potential climate 
benefits of biofuels, it is essential to consider 
the consequences of land-use change and of 
fertilization associated with growing biofuel 
crops** — particularly any changes in the 
carbon stocks of affected ecosystems, and in 
the emissions of nitrous oxide, a potent green- 
house gas produced by soil bacteria. It is also 
crucial to determine whether the growing of 
biofuel crops poses local threats to biodiversity, 
or to water and nutrient cycling’. 

Moreover, because biofuel feedstocks are 
currently produced mostly on fertile agri- 
cultural land, it has been questioned whether 
useful amounts of biofuels can be produced 
without threatening food production. The 
ensuing conflict of interest has been called the 
“food, energy and environment trilemma”’. 
To be acceptable to society, therefore, biofuel- 
production strategies must be shown to greatly 
mitigate greenhouse-gas emissions without 
jeopardizing food and animal-feed production 
through competition for land use, and to have 
a minimal effect on the environment. 

Gelfand et al. compared the biofuel yields, 
greenhouse-gas emissions, changes in soil- 
carbon stocks, and energy consumption asso- 
ciated with field operations for six biofuel- 
cropping systems in the midwestern United 
States over a 20-year period. They then used 
these data in a rigorous life-cycle assessment 
of the climate benefits of the different systems. 
Because it is based on long-term data, this is 
the first convincing analysis of the impact of 
biofuel-production systems on global warm- 
ing. By contrast, previous studies relied either 
on modelling or on short-term studies of a 
smaller number of systems. 

The authors show that all the biofuel- 
cropping systems investigated are net sinks of 
atmospheric carbon dioxide if fossil-fuel offset 
credits are included in the analysis. These cred- 
its are the sum ofall the CO, emissions poten- 
tially avoided when fossil fuels are replaced 
with biofuels, taking into account both the 
production and the combustion of the fossil 
fuels’. Surprisingly, the researchers found that 
the greenhouse-gas mitigation of wild, peren- 
nial, herbaceous vegetation (Fig. 1) — specifi- 
cally, successional vegetation, which naturally 
regrows in marginal areas such as abandoned, 
low-productivity arable land — was markedly 
higher than that of intentionally grown crops, 
including maize, alfalfa, poplar and a maize- 
soya bean-wheat crop rotation, and that 
energy production was comparable. More- 
over, Gelfand et al. show that moderate levels 
of nitrogen fertilization could further boost 
biofuel yields of the wild vegetation system by 
about 50%, with only a marginal increase in 
nitrous oxide emissions. 


*This article and the paper under discussion were 
published online on 16 January 2013. 
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Figure 1 | Wild biofuel crops. Gelfand et al.” 
report that wild, successional, herbaceous 
vegetation, such as that pictured, is an effective 
biofuel crop that has a greenhouse-gas-mitigation 
capacity rivalling that of traditional biofuel crops. 


A big advantage of such native successional 
systems over other biofuel crops is that they 
can be productive despite the soil and climate 
restrictions typically found in marginal lands. 
This suggests that marginal lands could bea 
viable alternative to fertile cropland for biofuel 
production — which would be extremely use- 
ful, given the limited land resources®”, 

To explore the regional implications of 
their study, Gelfand et al. used a computa- 
tional approach to identify suitable marginal 
lands for biofuel production across ten states 
of the US Midwest. More specifically, they used 
information from a geographical database ina 
biogeochemical model to estimate the effects 
of soil and climate on biofuel yields. 

One constraint on the production of biofuels 
is the need to minimize the energy consumed by 
the collection and transport of the crop. Gelfand 
and colleagues show that, given the distribu- 
tion of marginal lands in the US Midwest, 
optimal biofuel production would be achieved 
if biomass is collected from within a region of 
80-kilometre radius around refineries. Such a 
production strategy could yield approximately 
21 billion litres of ethanol per year from 11 mil- 
lion hectares of marginal land. This is about 25% 
of the target mandated by the US Department 
of Energy’s Biomass Program for cellulosic 
biofuel production in 2022 (cellulosic biofuel 
is that produced from lignocellulose, a major 
constituent of wood and grasses). It equates to 
an expected fossil-fuel offset of roughly 40 tera- 
grams of CO, equivalents each year (1 teragram 
is 10’” grams) — the same as the CO, emissions 
from 10 million medium-sized cars, each with 
an annual run of 20,000 km. 

So would a native successional biofuel 
crop be all good? Perhaps not. Gelfand and 


G. WP. ROBERTSON, MICHIGAN STATE UNIV. W. K. KELLOGG BIOLOGICAL STATION 


co-workers’ study does not explicitly answer 
the question of whether all the marginal lands 
identified as suitable for biofuel production 
could be used without harming biodiversity 
and the environment. Moreover, land that is 
fallow today might be needed in the future for 
agricultural production, to offset the demands 
of the world’s growing population. 

Another question raised by the study 
concerns greenhouse-gas mitigation: for the 
biofuel-cropping systems under consideration, 
the authors found that, apart from fossil-fuel 
offset credits, increases in soil-carbon stocks 
are the major driver of climate benefits. But 
the rate of increase of soil-carbon stocks will 
slow down with time, so that the stocks reach 


SOLAR PHYSICS 


an equilibrium level within a few decades”. It 
therefore seems that comprehensive assess- 
ments of the long-term climate impacts of 
biofuels will require the quantification of 
spatially and temporally explicit soil-carbon 
sequestration potentials. m 


Klaus Butterbach-Bahl and Ralf Kiese are 
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Towards ever smaller 
length scales 


Determining the real scale of structures in the Sun’s corona has proved difficult 
because of limited spatial resolution. Now high-resolution imaging has allowed 
dynamic structures on scales of 150 kilometres to be observed. SEE LETTER P.501 


PETER CARGILL 


Ts origin of the Sun’s outer atmosphere, 
the corona, is a long-standing scientific 
problem of great interest and complex- 
ity. Why does a star with a surface 
temperature of roughly 5,700 kel- 
vin have an outer atmosphere with 
temperatures in excess of 1 mega- 
kelvin, and why does the corona 
exhibit phenomena such as flares? 
The answer lies in the energy con- 
tained in the Sun’s magnetic field, 
which fills the corona, as inferred 
from coronal images at extreme 
ultraviolet and X-ray wavelengths. 
How the magnetic energy is dis- 
sipated in the corona and sustains 
its temperature is controversial, 
but comes down to a determina- 
tion of the spatial scales of coro- 
nal structures. On page 501 of 
this issue, Cirtain et al.’ identify 
dynamic structures on scales of 
150 kilometres, which represents 
a major constraint that theories 
must now confront. 

Before 2012, the best spatial 
resolution of the solar corona was 
obtained by NASA's Atmospheric 
Imaging Assembly (AIA) on the 
Solar Dynamics Observatory 
spacecraft, which was launched 
in 2010. The instrument resolves 


scales of about 900 km and looks at several 
wavelength ranges corresponding to different 
temperatures. However, images of the visible 
solar surface at a resolution of 100 km show 
distinct magnetic and plasma structures, 


Figure 1 | Small structure in the corona. The image is a sub-field of 

the entire field of view observed by the High-resolution Coronal Imager 
(Hi-C) and analysed by Cirtain and colleagues’. It shows the solar 

corona at a temperature of roughly 1.5 megakelvin over a dimension of 

154.6 x 123.7 arcseconds, or 112,000 x 90,000 km. The strands running from 
top left to lower right are believed to outline the magnetic field in the corona, 
as are the other structures in the image. The remarkably fine structure is 
visible everywhere and constitutes the major advance achieved with the Hi-C. 
(Image prepared by J. Cirtain and A. Winebarger.) 
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and so the question arises as to whether 
structures with these scales are also present in 
the corona. 

In their study, Cirtain et al. used the High- 
resolution Coronal Imager (Hi-C), a new 
extreme-ultraviolet instrument that was 
launched on a rocket on 11 July 2012 and 
obtained roughly 5 minutes of data before re- 
entering Earth’s atmosphere. The instrument 
looks at coronal plasma with temperatures 
of around 1.5 MK, and is capable of spatial 
resolution at least five times better than the 
AIA: the Hi-C can resolve scales on the order 
of 150 km. To place this in context, it took 
more than 30 years to improve the spatial 
resolution from the few thousand kilometres 
obtained by instruments on NASA’s Skylab 
observatory to that obtained with the AIA. 
The Hi-C instrumentation performed up to 
expectation, and images of the 
Sun show unambiguous structure 
at the desired resolution — a huge 
achievement. 

A striking feature of Cirtain and 
colleagues’ results is the dynamic 
structures visible at the limit of 
resolution, clearly evident by com- 
paring images from the Hi-C and 
AIA in the paper’s Supplemen- 
tary Videos 1 and 2 (ref. 1). (The 
reader should also look at other 
aspects of the videos to note how 
much else is happening on these 
small scales, as is also evident in 
Fig. 1.) The dynamic behaviour of 
the observed structures is inter- 
preted as evidence for ‘magnetic 
braiding; an effect in which small 
bundles of magnetic field become 
wrapped around each other owing 
to plasma motions at the solar sur- 
face’. Whether this is in fact the 
case is unclear, but there seems 
little doubt that magnetic-field 
dissipation on a fundamen- 
tal scale is seen, with different 
field elements interacting with 
one another through magnetic 
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reconnection’, a process that changes the 
magnetic-field topology through dissipation 
of electric currents. To me, the Hi-C images 
are reminiscent of computational models of 
the kink instability, a process known from 
plasma physics that is also thought to occur 
in the corona*. Although such processes have 
long been conjectured, prima facie evidence 
for coronal reconnection, as found by Hi-C, is 
an important result. 

A more general point concerns the very 
presence of structures at this resolution. 
There has long been a debate about when 
coronal structures are resolved; that is, what 
is an elemental structure? In the past, some 
have stated that structures seen by earlier solar 
missions are resolved, or ‘monolithic. Others 
have argued from theory and interpretation of 
data>”’ that scales on the order of 100 km were 
to be expected, and that such high resolution 
was needed. Indirect evidence from the AIA 
had also begun to point the way to such scales’, 
but the Hi-C results show that any debate 
on the structure of the corona now needs to 
address scales of 100-200 km or smaller, as can 
be seen in Figure 1. 

Clearly, even in 5 minutes of observations 
there is a wealth of data that need to be ana- 
lysed. The next stage is securing Hi-C, or an 
instrument with similar or improved per- 
formance, on an orbiting spacecraft. This 
spacecraft must also carry a modern extreme- 
ultraviolet spectrometer’ — both to comple- 
ment Hi-C and to provide fundamental plasma 
measurements of density, temperature, veloci- 
ties and small-scale turbulence — as well as an 
instrument capable of measuring signatures 
of energetic particles, which are known to be 
a significant product of the magnetic-recon- 
nection process’. Only with such complete 
instrumentation can a proper understanding 
of coronal structures be attained. 

Has Hi-C really resolved the corona? To do 
this will require observation of a wider range 
of solar conditions than is feasible in a short 
rocket flight, and one should not bet against 
the existence of further fine structure within 
the scales detected by Hi-C. But for those who 
have wanted to see observations on such scales 
for decades, there is a feeling that things are 
getting interesting, and quantitative tests of 
competing theoretical ideas can be under- 
taken, as is evident from the above discussion 
of this short data set. m 
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Gritting their teeth 


A comparison of the wearing effect of plant-derived silica and desert dust on 
tooth enamel suggests that extreme wear on teeth might not be caused by food. 
The findings may change some thoughts about the diets of human ancestors. 


BERNARD WOOD 


hewing is much like using a pestle and 

mortar, but upside down. Your jaw 

muscles supply the power and move 
your lower teeth (the pestle) up and across 
your upper teeth (the mortar). As long as any 
food caught between them is not especially 
hard or tough, it will be broken into smaller 
pieces and the teeth should remain intact. But, 
unlike the granite or marble pestle and mortar 
in your kitchen, teeth are gradually worn 
down. It has been widely assumed that inter- 
actions with food are the cause of this destruc- 
tive damage. But, writing in the Journal of the 
Royal Society Interface, Lucas et al. suggest' that 


the culprit may not be the food we chew, but 
the dust or grit we ingest along with it. 

When teeth are newly erupted into the jaw, 
the enamel surface is relatively pristine. How- 
ever, even though enamel is the hardest tissue 
in the body, once teeth have been used to chew 
on food, tiny grooves, scratches and pits soon 
pockmark the enamel. Living animals with 
different diets produce different patterns 
of dental microwear’”, so it was natural to 
assume that these patterns can be used to infer 
the types of food eaten by our ancestors and 
close evolutionary relatives’. However, a con- 
sideration of the physical processes that cause 
wear to hard surfaces calls some of these 
assumptions into question. 


Figure 1 | Signs of wear. The fossilized palate and maxillary teeth, seen from below, of OH 5, the type 
specimen of Paranthropus boisei, an archaic hominin (human ancestors and close relatives) that lived 
approximately 2 million years ago. The species is referred to as a hypermegadont, meaning that it has 
large, broad cheek teeth and small front teeth. Although OH 5 was not yet an adult, the enamel on the 
grinding faces of both first molars (M1), the premolars (P4 and P3) and the canines (C) has been worn 
down to expose the softer dentine in the pulp cavities of the teeth (examples of wear are indicated by 
arrows). Lucas et al.' suggest that only grit, not food, is hard enough to have removed the exceptionally 


thick enamel covering of these teeth so quickly. 
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Materials scientists define wear as what 
happens when an object loses some of its 
volume. If you dance in high heels on your 
neighbour’s softwood floor, you will annoy 
them by marking it, but marking merely 
deforms the surface of the floor — technically 
it is not worn. But if you were to dance in golf 
shoes, you will abrade the floor and eventually 
wear it away, because abrasion results in loss of 
volume. It seems unlikely that any food built 
primarily from either cellulose (deriving from 
plants) or collagen (from animal soft tissues) is 
going to abrade enamel, yet these are the foods 
we have in mind when we think about diet. 
Food acids can eventually dissolve enamel, 
but this is only a problem in some modern 
human populations. So what is it that wears 
teeth down? A basic understanding of wear in 
industrial processes suggests that it is probably 
something really hard, such as silica. 

In the context of chewing, resistance to 
indentation — or hardness — is the crucial var- 
iable. In their experiments, Lucas et al. used an 
orang-utan molar tooth asa proxy for the thick- 
enamelled fossil teeth of our ancestors and close 
relatives. They measured its hardness (which 
ranged between 4.08 and 5.72 gigapascals), and 
compared it with the hardness of two sorts of 
phytolith, deriving from squash (which had a 
hardness range of 0.43-1.74 GPa) and grass 
(1.35-4.24 GPa), and of Kuwaiti desert dust 
(10.1-14.1 GPa). Phytoliths are silica particles 
produced by plants as a defence mechanism, 
but they are non-crystalline and, as can be seen 
from these hardness measurements, they are 
softer than both quartz grit and enamel. 

The authors mounted individual micro- 
scopic particles of the phytoliths, quartz dust 
and chips of enamel from a tooth ona nanoin- 
denter (a device used for measuring wear 
effects on materials at the nanoscale) and then 
slid these across the surface of tooth enamel 
at a known force and angle. They found that 
neither the phytoliths nor the enamel chips 
resulted in any loss of volume of the tooth; the 
particles marked it, but they did not abrade it. 
Only the quartz dust resulted in the volume 
loss that would eventually accumulate to pro- 
duce the type of wear on a tooth that is visible 
to the naked eye (Fig. 1). 

The authors also found that the enamel was 
abraded only if the quartz particles have the 
right angle of ‘attack. The critical attack angle 
is set by the toughness of the tooth tissue; it is 
low for enamel, but higher for dentine, which is 
softer but tougher. Without the right geometry, 
even very hard particles will not abrade enamel. 

All methods for reconstructing evolutionary 
history begin with observations about pattern, 
but what really matters are the processes that 
generate the pattern. Interpretations based on 
dental microwear are at the stage of shifting 
from questions about the patterns to more 
fundamental questions about the basic phys- 
ics of the processes that determine them, and 
these findings suggest that dental wear results 


from the dust and grit that is ingested along 
with food®*, rather than by factors intrinsic to 
the food. 

So what do these results mean for attempts to 
reconstruct the diet of our ancestors and their 
close relatives? As well as dental microwear, 
the shape and size of the skull and teeth can be 
used to generate hypotheses about the types of 
food (hard versus soft, tough versus weak) that 
these primates were adapted to eat. The chemi- 
cals that accumulated in their bones and teeth 
also reflect the types of vegetation (dry tropi- 
cal grass versus leaves) consumed’. And dental 
macrowear can be used to interpret how quickly 
the enamel covering has been removed from the 
teeth. In many cases, these lines of evidence lead 
to similar conclusions, but in one of our close 
evolutionary relatives, Paranthropus boisei, 
the microwear on the teeth’ is not consistent 
with the extreme dental macrowear seen in this 
taxon (Fig. 1). Lucas and colleagues suggest that 
this extreme macrowear is not caused by diet, 
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but instead is most likely to be the result of these 
creatures literally ‘gritting their teeth: = 
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The to and fro 
of tumour spread 


Two studies shed light on the role of cellular transitions between the epithelial 
and mesenchymal states during cancer metastasis, and provide food for thought 
as to which cellular processes should be targeted in cancer treatment. 


BRYCE J. W. VAN DENDEREN 
& ERIK W. THOMPSON 


he spread of cells from the primary 

site of a solid tumour to distant sites 

remains the major cause of disease 
and death associated with these cancers. For 
tumour cells to spread, or metastasize, they 
must modify their ‘anchored’ state and detach 
from their neighbouring cells; migrate through 
tissues into the blood and lymph systems; sur- 
vive in these circulation systems; and then 
leave the vessels at an appropriate site to form 
another tumour’. Many of these events are 
favoured by conversions between two cellu- 
lar states — the epithelial and mesenchymal 
phenotypes. But the role of these transitions in 
cancer metastasis is controversial. Writing in 
Cancer Cell, Tsai et al.” and Ocafia et al.* help 
to clarify this issue. 

Most cancers in adults are carcinomas, 
which arise from abnormal growth of epithe- 
lial cells that line the surfaces and cavities of 
organs such as the breasts, colon, lungs and 
liver. Epithelial cells are fully differentiated 
and non-mobile, whereas mesenchymal cells 
migrate easily. In the developing embryo, a 
process called the epithelial-mesenchymal 


transition (EMT) allows epithelial cells to 
take on mesenchymal-cell characteristics and 
move sites, after which they revert back to an 
epithelial phenotype through the mesenchy- 
mal-epithelial transition (MET), and continue 
to form body structures. EMT is also involved 
in metastasis, but whether it is an absolute 
requirement is controversial, in part because 
metastatic tumours display an epithelial 
phenotype and lack mesenchymal markers’. 
We have coined the term epithelial-mesen- 
chymal plasticity for the dynamic interchange 
between epithelial and mesenchymal pheno- 
types”. Ten years ago, it was hypothesized that 
tumour cells that had undergone EMT and dis- 
seminated would then go through MET to suc- 
cessfully colonize a secondary site and form a 
metastatic tumour®; and preliminary evidence 
for MET has been reported in cancers of the 
colon, breast, bladder and prostate’. Although 
the abundance of studies on EMT still dwarfs 
those on MET, it is emerging asa crucial and 
possibly rate-limiting step in metastasis, and 
this is supported by the two new papers. 
Previous work has shown that the EMT- 
inducing transcription-factor protein Twist1 
is essential for spontaneous metastasis in a 
mouse model of mammary cancer’. Tsai and 
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Figure 1 | Cellular transitions in cancer metastasis. Dissemination of cells 
from a primary solid tumour is facilitated by the epithelial-mesenchymal 
transition (EMT). This process allows epithelial tumour cells (carcinoma 
cells), which are fully differentiated and non-mobile, to acquire the more 
invasive characteristics of mesenchymal cells. Tsai et al.” show that EMT is 
induced by a transcription factor called Twist1, and that Twist1 expression 
also increases the number of metastable cells, which express both epithelial 


co-workers have built on these findings using 
a mouse model of squamous-cell carcinoma 
in which Twist] expression can be turned 
on and off. The authors found that Twist1 
induces EMT and tumour-cell transit into 
the bloodstream (Fig. 1), but that metasta- 
ses did not form if the cancer cells could not 
switch Twist1 off after disseminating. This 
suggests that halting EMT, and thereby allow- 
ing MET, is required for completion of the 
metastatic process. 

Ocafia et al. have characterized a new EMT- 
inducing transcription factor, Prrx1, that they 
show drives EMT during embryonic develop- 
ment in chicks. The authors also show that, 
similarly to other embryonic EMT-driving 
factors, Prrx1 seems to play a part in the inva- 
sive, migratory phenotype of breast carcinoma 
cells undergoing metastasis. Moreover, the 
authors found that forced, continuous expres- 
sion of Prrx1 blocked the capacity of other- 
wise metastasis-competent cells to produce 
metastatic tumours, and that Prrx1 suppres- 
sion is needed for MET to proceed — con- 
sistent with Tsai and colleagues’ observations 
for Twist] (Fig. 1). 

A notable advance in our appreciation of the 
role that EMT may have in solid-tumour biol- 
ogy came from observations that “breast cancer 
stem cells’ (cells isolated from clinical samples 
that have particularly strong cancerous prop- 
erties and the ability to initiate new tumours 
from small numbers of cells) exhibit a mesen- 
chymal profile’. Conversely, both normal and 
cancerous mammary epithelial cells that are 
induced to undergo EMT become more stem- 
cell-like®. This relationship was further refined 
when cells with progenitor-cell properties — 
those committed more towards differentiation 
than the mesenchymal-like stem cells — were 
seen to have higher cancerous potential’””. 

Adding to this picture, Ocafia et al. show that 
the stem-cell-like features of carcinoma cells 
can be segregated from EMT by manipulating 
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the expression of Prrxl. Remarkably, they 
show that abrogation of Prrx1 in carcinoma- 
derived cells (called BT-549 cells) that would 
not normally form tumours when injected into 
a mouse makes them both tumorigenic and 
metastatic. Moreover, they show that reducing 
Prrx1 levels in cancer cells causes a reduction 
in EMT, buta gain of stem-cell activities. These 
findings are consistent with recent reports 
that pluripotency (the ability to differentiate 
into various cell types), which is associated 
with the epithelial phenotype more than the 
mesenchymal phenotype, is a major driver of 
both cancerous and metastatic potential”. It 
is thus implicit that exhibiting the enhanced 
malignant properties associated with the mes- 
enchymal stem-cell state requires an ability to 
spontaneously progress towards an epithelial 
state with pluripotent characteristics. 

An additional twist to the tale comes from 
the fact that some tumour cells simultane- 
ously show mesenchymal and epithelial char- 
acteristics'*"*. Also called the intermediate, 
or metastable, phenotype, it has long been 
recognized that co-expression of both pheno- 
types may offer cells a form of pluripotency, 
allowing them to dynamically adjust to the 
circumstances they encounter. Such plasti- 
city seems to be key for the full spectrum of 
metastatic competence, because cells that are 
rigidly locked into one state or another are 
less capable of metastasis or even of primary- 
tumour growth, as seems to be the case with 
BT-549 cells. Furthermore, other studies” 
have shown that cooperation between mesen- 
chymal and epithelial variants of some cancer 
cells can allow the epithelial variants to escape 
their tumour site and metastasize. Thus, the 
combined evidence from these studies and 
the present papers suggests that both mesen- 
chymal and epithelial phenotypes are required 
for metastatic competence, and that cancer 
cells must have sufficient plasticity between 
these phenotypes to spread. 
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and mesenchymal characteristics, in the bloodstream of mice. However, 

the authors found that sustained Twist1 expression prevents the formation 

of secondary tumours, possibly by preventing the cells from undergoing 

the mesenchymal-epithelial transistion (MET) that is needed for them to 
revert to the epithelial state and form metastases. Ocaiia et al.’ characterize 
another EMT-inducing transcription factor, Prrx1, and show that MET can be 
achieved with Prrx1 suppression, even during continued Twist1 expression. 


The reports by Tsai et al. and Ocaiia et al. 
leave little doubt as to the importance of MET 
and/or the epithelial phenotype for carcinoma 
metastasis. As stated by both sets of authors, 
this introduces the question of whether the 
therapies that many laboratories are chasing — 
treatments designed to inhibit cancer cells in 
the mesenchymal state — might in fact have a 
stimulatory effect on established metastases, or 
even activate dormant cancer cells. Models and 
experimental designs, such as those used for 
these studies, will be crucial to resolving this 
question, and they herald an exciting chapter 
in our understanding of metastasis. m 
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Eemian interglacial reconstructed from a 
Greenland folded ice core 


NEEM community members* 


Efforts to extract a Greenland ice core with a complete record of the Eemian interglacial (130,000 to 115,000 years ago) 
have until now been unsuccessful. The response of the Greenland ice sheet to the warmer-than-present climate of the 
Eemian has thus remained unclear. Here we present the new North Greenland Eemian Ice Drilling (“NEEM”) ice core and 
show only a modest ice-sheet response to the strong warming in the early Eemian. We reconstructed the Eemian record 
from folded ice using globally homogeneous parameters known from dated Greenland and Antarctic ice-core records. 
On the basis of water stable isotopes, NEEM surface temperatures after the onset of the Eemian (126,000 years ago) 
peaked at 8 + 4 degrees Celsius above the mean of the past millennium, followed by a gradual cooling that was probably 
driven by the decreasing summer insolation. Between 128,000 and 122,000 years ago, the thickness of the northwest 
Greenland ice sheet decreased by 400 + 250 metres, reaching surface elevations 122,000 years ago of 130 + 300 metres 
lower than the present. Extensive surface melt occurred at the NEEM site during the Eemian, a phenomenon witnessed 
when melt layers formed again at NEEM during the exceptional heat of July 2012. With additional warming, surface melt 


might become more common in the future. 


A 2,540-m-long ice core was drilled during 2008-12 through the ice at 
the NEEM site, Greenland (77.45° N, 51.06° W, surface elevation 
2,450m, mean annual temperature —29°C, accumulation 0.22 m 
ice equivalent per year). The top 1,419 m is from the current intergla- 
cial, the Holocene, and together with the glacial ice below it can be 
matched to the NGRIP GICCO05 extended timescale’* down to 
2,206.7 m (108 thousand years before present, referred to as kyr Bp, 
where ‘present’ is defined as AD 1950). Below this, the ice is disturbed 
and folded, but it contains zones with relatively high stable isotope 
values of H,O (8'°O;-.5a proxy for condensation temperature), indicating 
that it stems from the last interglacial, the Eemian (130-115 kyr Bp; Fig. 1). 
Near bedrock, low 5'*O,.. values suggest that the ice layers are 
most probably from the glacial period before the Eemian. The lowest 
5 m of the ice core contain accreted ice, with dark layers 1-20 cm thick 
that contain high concentrations of basal material. In this study, 
information from the Eemian period of the NEEM ice core will be 
used to constrain the surface elevation of the ice sheet and the tem- 
perature of this warm climate period. Measurements of 5'80;.¢ have 
been made, and air bubbles trapped within the ice have yielded con- 
centrations of CH, and N,O, stable isotope values 8'°N of N, and 8'°O 
of Oy (5'%O,4,), and total air content (see details in Supplemen- 
tary Information). In addition, the rheology of the ice, radio echo 
sounding (RES) images and surface temperatures and ice tempera- 
tures are used in the interpretation. 


Reconstruction of the climate record 


Stratigraphic disruptions are identified from discontinuities of a Oe 
at depths of 2,209.60 m, 2,262.15m, 2,364.45m and 2,432.19 m. 
Corresponding shifts in gas concentrations are found at these depths, 
so the bubble enclosure process has not caused the expected depth 
offset for stratigraphic undisturbed ice (Fig. 1). Possible disconti- 
nuities below 2,432.19 m have not been investigated. The records of 
CH, concentrations and 8'°Ogtm are also disturbed and are not iden- 
tical with the globally-homogeneous signals documented at the 
nearby NGRIP ice core (which contains undisturbed stratigraphy 


back to 123 kyr Bp) or with the EDML Antarctic ice core (which 
reaches back more than 135 kyr Bp)*"*. Measurements of N2O, 3°N 
and air content in the NEEM ice below 2,200 m further confirm these 
discontinuities. 

NEEM data reveal spikes in CH, and N20 records between depths 
of 2,370 m and 2,418 m, which are too rapid to be explained by cli- 
matic variability and coincide with lower air content in the ice 
(Figs 1b, 2, 4, shaded areas, and Supplementary Fig. 6). These cha- 
racteristics point to surface melting or wet surface conditions. Indeed, 
surface melting or percolating rain reduces the firn air content and 
allows in situ production of CH, and N,O. This hypothesis is sup- 
ported by results in the near-surface ice from the warmer south 
Greenland Dye3 ice core (Supplementary Fig. 9, mean annual tem- 
perature —21°C) and by measurements of noble gases over the 
NEEM spikes, which also support the hypothesis of melting surface 
layers (Supplementary Fig. 8). The lack of parallel variability in 
5'8Oice 5'°N or 8'8O.tm Suggests that these parameters are uninflu- 
enced by surface melting. The spikes occur in the warmest interval 
at the depositional location indicated by water isotopes (8° Ojc6e > 
—33%o0). The mean water isotope value over the past millennium is 
—33.6%0 at NEEM, and very few melt layers are found in the ice core 
before 1995’. During our NEEM field campaigns (2007-12), the mean 
surface air temperature in July reached —5.4°C (with annual mean 
values of 5'°Os;n > —33%o; ref. 8), and studies with ice cores and 
snow pits show that episodic melt events occurred. Over the period 
12-15 July 2012, an exceptional heat wave produced significant 
surface melt over 97% of the Greenland ice sheet, leaving a strong 
fingerprint at the NEEM site in the form of melt layers 5-6 cm thick 
at 50-70 cm below the surface. 

For depths of 2,201.10-2,432.19 m, the NEEM records of 88 Oanm 
and the subset of CH, values not corrupted by surface melting are 
matched with globally-homogeneous signals of these values observed 
from other ice cores*”"® (Supplementary Figs 4 and 5). For younger 
ice, the nearby NGRIP 5180... record is used as a reference tempera- 
ture target for synchronization by assuming simultaneous abrupt 


*Lists of participants and their affiliations appear at the end of the paper. 
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Figure 1 | Observed NEEM records. a, A surface elevation map (scale to the 
right) of the position of the NEEM camp, including coloured upstream 
depositional positions (upstream time marker scale below) of the ice found in 
the NEEM ice core. The deep ice-core sites NGRIP, NEEM and Camp Century 
and the tracks of the RES images shown in Fig. 3f and g (red lines) are shown on 
the map. b, The observed records of 8'°O je, 5'“Oatm (reversed scale) CHy, N20, 
3!°N and air content (reversed scale) from 2,162 m and deeper are plotted on 
the NEEM depth scale (bottom x axis). The air content measurements from 
LGGE and from University of Bern (thin curve, small symbols) are both 
included. Zones 0-6 (blue, 0-2,206.5 m; cyan, 2,206.5-2,209.6 m; green, 
2,209.6-2,262.2 m; orange, 2,262.2-2,364.5 m; red, 2,364.5-2,418.0 m; dark red, 
2,418.0-2,432.2 m; and brown, 2,432.2-2,537 m) represent the sections of the 
NEEM ice-core records with symbols marking the start (diamond) and end 
(square) of each zone. There is no discontinuity between zones 4 and 5 but 
spikes of CHy, N2O and air content occur in zone 4 (shaded grey). For 
comparison, the NGRIP data are plotted as light grey curves on the NGRIP 
depth scale on top of the plot. The NEEM and NGRIP depth scale are 
synchronized between 2,162 and 2.207.6 m NEEM depths. 
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Figure 2 | Reconstructed records from the NEEM ice core. The 
reconstructed records of 5'°Oj-¢, 5'8Oaun (reversed scale) CH, N20, 

8'°N and air content (reversed scale) on the EDMLI timescale’*. Zone 1 
(cyan) is seen to contain a fold while zone 2 (green) and zone 3 (orange) 

are reversed and cover identical time periods. NGRIP records (light grey) 
and EDML records (dark grey) are included where they are available. 

The CH, N2O and air content records contain spikes from 127 to 118.3 kyr Bp 
(shaded grey). 


climate changes at the transitions between stadials and interstadials" 
(Supplementary Fig. 3). 

In Fig. 2, the reconstructed NEEM records are shown on the 
EDMLI timescale’* and compared to the NGRIP (light grey) and 
EDML (dark grey) records. Zones 1 to 5, identified in Fig. 1, map 
onto the timescale as coherent pieces. Zone 1 is folded such that the 
records are mirrored and repeated, zone 2 and zone 3 cover identical 
time periods, both inverted, while zone 4 (with melt-related spikes) 
and zone 5 are undisturbed and contain the major part of the ice from 
the Eemian (128.5-115 kyr Bp) (Figs 2 and 3a, b). It cannot be ruled 
out that small disruptions or folds are present within the individual 
zones. The reconstructed records, however, show no unexpected dis- 
continuities in either zones 1-5 or in all other measured parameters, 
such as 8'°N (Fig. 2), dust or electrical properties. As the timescale is 
transferred from the EDML ice core by matching it to the globally- 
homogeneous signals, small undetectable disruptions will not influ- 
ence the conclusions based on the parameters presented here. The 
reconstruction is unambiguous and no other solution exists to match 
the NEEM 81°O,1m and the uncorrupted CH, values simultaneously 
to the undisturbed EDML and NGRIP records. No ice from 114- 
108 kyr Bp can be found in the NEEM ice core, while the ice layer 
from 118-115 kyr sp is found three times (zones 2, 3 and 4), two of 
which are inverted. 

Northern Hemisphere temperatures are known to vary in parallel 
with the atmospheric CH, concentration’*"’, which is seen to increase 
abruptly into the warm Eemian period at 128 kyr Bp at EDML. This is 
evidence that no substantial warm phases of the NEEM Eemian 
record are missing before 128.5 kyr Bp. Ice from below 2,450 m (zone 
6) appears to be too disturbed to reconstruct an age scale based on 
the data available at present. The very low (glacial) 5180,.. values 
(—44.9%0 at 2,476m) are first found again in the record above 
2,011 m. The ice layering at the NEEM site is undisturbed above 
2,200 m (Fig. 3g), so the section below 2,476 m is believed to contain 
ice older than 128.5 kyr Bp from Marine Isotopic Stage (MIS) 6, and we 
may speculate that the increased levels of 5'8O;ce, CH, and N,O at 
2,480 m might be from MIS-7 interglacial. 
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Figure 3 | Disturbances of the deep NEEM ice. a, The reconstructed NEEM 
water stable isotope record (black curve) including NGRIP data between 114- 
108 kyr BP (grey) on the EDML1 timescale that has been colour coded 
according to age. Below 2,432.19 m (grey shading) no dating has been 
attempted. b, The colour-coded data plotted with the original data on the 
NEEM depth scale visualizing the discontinuities and the reversed sections. 

c, d, c-Axis orientations on Schmidt diagrams for depths 2,203 m, 2,308 m and 
2,395 m (c) and polarized light pictures of ice thin sections from the same three 
depths (d) demonstrate the very different ice rheology of the glacial and 
interglacial ice, leading to different flow properties. e, Cartoon of a possible 
history of the formation of two overturned folds at NEEM inspired by the 
observed undulations in the RES images (f and g). f, g, Center for Remote 
Sensing of Ice Sheets (CReSIS) ice-penetrating RES images from NASA 
Operation IceBridge campaign 2011. f, A 52-km-long line to the west that 
illustrates that overturning folds do occur (MCORDS: 20110329_02_020); g, a 
48-km-long line crossing the deep drill site (MCORDS 20110329_02_028). The 
NEEM site is marked on the image, and two dated horizons are marked. More 
details can be found in Supplementary Information. 


The disturbed and folded ice 

RES data were collected in north Greenland during 2011 and 2012 
with a multichannel coherent ice-penetrating radar depth sounder/ 
imager’*’°. The RES images show continuous and undisturbed inter- 
nal layers (isochrones) to a depth of about 2,200m in the NEEM 
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region, which agrees with the ice-core observations!’”. Below 2,200 m, 
internal layers become fuzzy and less continuous: undulations and 
even overturned folds and shearing of basal material are observed 
(Fig. 3f, g, Supplementary Fig. 2b-e). The transition between clear 
and fuzzy layers often appears at the interface between ice from the 
glacial and Eemian periods. Very large differences in ice rheological 
properties are documented (Fig. 3c, d) between glacial ice (with crystal 
sizes of 1.5mm and a strong preferred vertical c-axis orientation) and 
Eemian ice (with crystal sizes of 25 mm and multiple maxima fabrics). 
The viscosities of these two types of ice differ by a factor of 50-100 (refs 
18-21), allowing glacial ice to deform very easily while the interglacial 
ice remains more rigid. The missing, folded and inverted parts of 
the ice are from the interface between the relatively rigid Eemian 
and the early glacial (118-106 kyr Bp), supporting the idea that the 
glacial ice deforms over the harder Eemian ice. The broken record at 
128.5 kyr Bp can be explained by contrasting the deformation pro- 
perties of the underlying ice from the previous glacial period and 
the Eemian ice. It should be noted that the discontinuous and folded 
structures of the GRIP and GISP2 ice cores also occur for ice older than 
105 kyr (refs 9, 10). The consistency of the RES images and deep ice- 
core results at NEEM is a breakthrough result, and demonstrates that 
RES imaging can now be used to predict folded ice layering. This 
creates the potential for a systematic reconstruction of the Eemian 
Greenland ice-sheet layering from new RES imaging. Assimilation 
of such data in ice-sheet models may lead to much improved histories 
of the configuration of the ice sheet in the past, improving our ability to 
predict its future evolution. 


Climate reconstruction from observed records 


The reconstructed Eemian sequence (128.5-114 kyr pp, Fig. 2) allows 
for initial climate interpretations of this period. As mentioned above, 
the regular occurrence of melt features at 127-118.3 kyr BP is an 
indication of warmer temperatures at the depositional surface loca- 
tions of the ice than the mean of the recent millennium at NEEM. This 
is independently confirmed by the decrease of 8'°N in this zone, 
which is indicative of ~5 °C warmer mean annual firn temperatures 
at the depositional site**’’ (Supplementary Fig. 7). Between 128.5 kyr 
Bp and 126.0 kyr Bp, 5180,.. increases from —35%o to —31.4%o while 
EDML 5'%0;.. values slowly drop from those of the warm early 
Antarctic period**. This bipolar see-saw behaviour suggests that 
inter-hemispheric heat redistribution by the Atlantic meridional 
overturning circulation” was taking place within the Eemian inter- 
glacial period; this has also been observed during the last Northern 
Hemisphere deglaciation 19-11 kyr Bp**”’. Before surface melt began 
between 128.5 and 126.7 kyr sp, the air content at the depositional 
site had a stable level of 85 ml kg” ' compared to the present level of 
97.5mlkg *. When corrected for changing local summer insola- 
tion** °° (Supplementary Fig. 10), the air content difference suggests 
a surface elevation at the depositional site 540 + 300m higher at 
the onset of the Eemian (128 kyr sp) than the surface elevation at 
NEEM today***'**. The locations of the depositional sites of the 
Eemian ice found in the NEEM ice core are modelled using a nested 
three-dimensional flow model” (Fig. 1a and Supplementary Fig. 1). A 
second model used to date the NEEM ice core reaches similar loca- 
tions** within 20 km. At present, the surface elevation at the deposi- 
tional site of the 128-kyr-sp ice (205 + 20 km upstream from NEEM) 
is 330 + 50 mhigher than the present at NEEM****. The surface eleva- 
tion increase of 210+350m at the 128-kyr-sp depositional site 
(Fig. 4c, blue) is the difference between the elevation at 128 kyr Bp 
(540 + 300 m) and the present elevation (330 + 50 m), both related 
to the present elevation at NEEM. This surface elevation increase is 
expected at the onset of a warm climatic period due to increased 
precipitation and mass balance changes that occur before the central 
part of the ice sheet adjusts to the warmer climate by increasing the ice 
flow. This is also established at the onset of the present interglacial 
at 11.7 kyr Bp*. 
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Figure 4 | Reconstruction of the temperature and elevation history. 
Reconstruction of the temperature and elevation history through the Eemian 
based on the stable water isotopes (5'80,..) and the air content records. The 
zone with surface melt (127-118.3 kyr Bp) is shaded in light grey. a, The 
measured 5'8O;.. record (black) on the constructed timescale. The average of 
the recent millennium (—33.60%o) is marked with a thin black line. It is seen 
that the 8'°O,.. values at the depositional locations in the melt zone are above 
—33.0%o (grey horizontal line). The fixed-elevation change of temperature— 
constructed from the observed 8'°O;.., the elevation changes determined from 
the air content and the upstream corrections (curves below)—is shown as a red 
curve using the red axis. The standard error range (orange shading) is a sum of 
the error of the 5'80;,. and the elevation change correction (Supplementary 
Information 1.1, equations (2) and (5) in Supplementary Information, 
Supplementary Table 2). b, Air content (black) is plotted and guided by the two 
stable levels on each side of the melt zone. A dot-dashed line connecting these 
levels has been suggested with an error range as the dark grey shaded area. The 
standard error range is a sum of the error assumed in the zone with surface melt 
(127-118.3 kyr Bp) and the 1% error on the air content measurements 
(Supplementary Information section 1.4). In addition, the average level 107- 
105 kyr Bp is marked with a horizontal black bar. The changes in the air content 
are caused by pressure changes due to changing surface elevation at the 
depositional sites and changes to the air trapping processes in the firn assumed 
to be controlled by the changing summer insolation***". c, When corrected 
for upstream flow (cyan) and summer insolation changes (green), the air 
content curve can be ‘translated’ to elevation changes (blue, dashed) with the 
shaded zone indicating the uncertainty range introduced by this translation. 
Blue bars mark the air content of the ‘translated’ air content black bars. The 
standard error range is based on the error range of the air content (dark grey 
shaded area) and the additional standard errors from calculation of the 
elevation changes (equation (5) in Supplementary Information, and 
Supplementary Table 2). 


In the period 127-118.3 kyr Bp, the air content in the ice where 
surface melt occurred was highly variable and cannot directly be used 
for ice elevation reconstructions (Fig. 2, shaded zone). We can tenta- 
tively estimate elevation changes through the Eemian climate period 
by connecting the two air content levels before and after the melt zone 
(Fig. 4b) after correcting for summer insolation, which accounts for 
50% of the observed change (Fig. 4c, Supplementary Information). 
At 126kyr pp the surface elevation was 45 + 350m higher than at 
present. The 5'80,.. increased to —31.4%o at 126 kyr Bp, exceeding 
the current mean value of the recent millennium of —33.6%bo (at the 
NEEM site) and the current mean value of —35.0%o at the deposi- 
tional site’? (Supplementary Information, section 2). Using the 
temperature-isotope relation of 2.1 +0.5K%o ' (calibrated using 
data from the present interglacial**), the 3.6%0 anomaly at 126 kyr 
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BP implies that precipitation-weighted surface temperatures were 
7.5 +1.8°C warmer at the depositional site compared to the last 
millennium. Note that the modelled location of the depositional site 
is the only modelled parameter required to compare the 126-kyr-BP 
data to the present-day data at the depositional site. When further 
correcting for the more uncertain elevation change of 45 + 350m at 
the 126-kyr-np depositional site using a lapse rate of 7.5 + 0.5K km ', 
the fixed-elevation temperature increase here is 8 + 4 °C (Fig. 4a, red). 
Our data depict a gradual cooling until 110 kyr Bp (Fig. 4a, red curve). 

The reconstructed precipitation-weighted annual temperature 
changes are remarkably high. In general, warmer summer tempera- 
tures are reported from palaeorecords**”’, and a few find tempera- 
tures at 126 kyr Bp on high Arctic latitudes as high as those reported 
from NEEM***°. Climate models equipped with water stable iso- 
topes point to a limited (1 °C) seasonality bias caused by a stronger 
enhancement of temperature and precipitation in summer than in 
winter*’*’. A large spread in temperature has been reported among 
simulations of the last interglacial climate, which appear to systema- 
tically underestimate North Atlantic/Arctic warming, possibly due to 
missing vegetation and ice-sheet feedback*”**™. 

Within 6,000 yr, from 128 to 122 kyr bp, the surface elevation is 
estimated to have decreased from 210 + 350 m above to 130 + 300m 
below the present surface elevation, which translates to a moderate 
ice thickness change of 400 + 350m after accounting for isostatic 
rebound. Based on this estimate, the ice thickness at NEEM decreased 
by an average of 7 + 4cm per year between 128 and 122 kyr Bp and 
stayed at this level until 117-114kyr pp, long after surface melt 
stopped and temperatures fell below modern levels. 

Even with minimum ice thickness of only about 10% less than the 
present ice thickness at the NEEM site, as reported here, substantial 
melting can cause significant reduction of ice thickness near the mar- 
gins; this in turn reduces the volume of the Greenland ice sheet. 
Although the documentation of ice thickness at one location on the 
Greenland ice sheet cannot constrain the overall ice-sheet changes 
during the last interglacial period, the NEEM data can only be recon- 
ciled with Greenland ice-sheet simulations” that point to a modest 
contribution (2m) to the observed 4-8m Eemian sea level high 
stand***. For comparison, no continuing elevation change has so 
far been detected in areas with elevations above 2,000 m in north 
Greenland during the past few decades**. These findings strongly 
imply that Antarctica must have contributed significantly to the 
Eemian sea level rise*’. 

Despite the complex ice flow, the disturbed record of the deep ice in 
the NEEM ice core can be unambiguously reconstructed. The ana- 
tomy of the last interglacial shows that Greenland temperatures 
peaked after the onset of the Eemian, 126 kyr sp, with temperatures 
(at fixed elevations) 8 + 4°C warmer than the average of the recent 
millennium and multiple indications of summer melt. Temperatures 
gradually decreased during the interglacial, very probably owing to 
the strong local summer insolation decreasing trend. The surface ele- 
vation first increased due to increased mass balance to 210 + 350m 
above the present at 128 kyr bp, then decreased to 130 + 300 m below 
the present elevation around 122 kyr Bp. Our results provide multi- 
ple new targets to constrain coupled climate/ice-sheet models. Our 
record, together with recent observations of rainfall and strong 
surface melting in July 2012 at NEEM, show that conditions are 
conducive to the start of melt layer formation at NEEM, with the 
2010-12 mean annual surface temperatures 1-2 °C above the 1950- 
80 average. 

Our results have implications for both ice deformation near the 
bedrock and the response of the Greenland ice sheet to climate 
change. The combination of high-resolution RES data and NEEM 
glacial—interglacial ice layers brings new knowledge of the near-bed 
deformation of ice. We believe that the folding and disturbances we 
observe near the bed are strongly related to the rigid deformation 
properties of the interglacial ice. This offers an alternative explanation 
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for the large anomalies in RES profiles recently observed under both 
the Antarctic and Greenland ice sheets, which were previously attri- 
buted to refrozen basal water®. 


METHODS SUMMARY 


Measurements of stable water isotopes in the ice (5'80;-e) and CH, 
N,O concentrations and the isotopes 5'°N of N3 and 8'°O,tm of Oo, 
noble gases and the air content have been measured in the air 
extracted from the air bubbles in the ice, all using well-described 
methods. These measurements are presented in Supplementary 
Information section 1. Models calculating temperatures from water 
isotopes, elevation changes from air content, and temperature 
changes from 8'°N values have been calibrated using observations 
from the present interglacial. 
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LTP requires a reserve pool of glutamate 
receptors independent of subunit type 


Adam J. Granger’, Yun Shi’, Wei Lut, Manuel Cerpas” & Roger A. Nicoll? 


Long-term potentiation (LTP) of synaptic transmission is thought to be an important cellular mechanism underlying 
memory formation. A widely accepted model posits that LTP requires the cytoplasmic carboxy] tail (C-tail) of the AMPA 
(a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptor subunit GluA1. To find the minimum necessary 
requirement of the GluA1 C-tail for LTP in mouse CA1 hippocampal pyramidal neurons, we used a single-cell molecular 
replacement strategy to replace all endogenous AMPA receptors with transfected subunits. In contrast to the prevailing 
model, we found no requirement of the GluAl C-tail for LTP. In fact, replacement with the GluA2 subunit showed normal 
LTP, as did an artificially expressed kainate receptor not normally found at these synapses. The only conditions under 
which LIP was impaired were those with markedly decreased AMPA receptor surface expression, indicating a 
requirement for a reserve pool of receptors. These results demonstrate the synapse’s remarkable flexibility to 
potentiate with a variety of glutamate receptor subtypes, requiring a fundamental change in our thinking with regard 


to the core molecular events underlying synaptic plasticity. 


Information storage in the brain is widely accepted to involve the rapid 
increase in synaptic strength between two neurons that can persist 
over long periods of time. This phenomenon, known as long-term 
potentiation (LTP), has been well described at glutamatergic synapses 
in the hippocampus, a region of the brain that is required for the 
formation of new memories. At these CA1 synapses, LTP is expressed 
by the immediate increase in postsynaptic AMPA-type glutamate 
receptors (AMPARs) after coincident activation of pre- and post- 
synaptic neurons. However, the exact mechanism of rapid AMPAR 
insertion during LTP is not fully understood. 

AMPARs mediate most fast, excitatory synaptic transmission in 
the brain. A functional AMPAR is a tetramer of individual subunit 
proteins, of which there are four unique isoforms, GluA1, GluA2, 
GluA3 and GluA4 (refs 1, 2). In CA1 pyramidal neurons, which serve 
as a model for understanding LTP, most receptors exist as GluA1- 
GluA2 heteromers, with a minor contribution from GluA2-GluA3 
receptors**. Over the past decade, a large body of research has focused 
on how individual AMPAR subunits are trafficked. A widely held 
model posits that GluA1-GluA2 receptors are excluded from synapses 
unless an LTP stimulus is provided, whereas GluA2-GluA3 receptors 
traffic to the synapse constitutively. This difference in trafficking beha- 
viour is mediated by the C-tails of the individual subunit proteins*’. 
Supporting this model is the finding that LTP is impaired in GluA1 
knockout mice® but is normal in GluA2-GluA3 double knockouts’. On 
the basis of these findings, a broad consensus has emerged that LTP is 
mediated by synaptic insertion of GluA1-containing receptors via its 
C-tail interactions’”””. 

Despite the consensus that GluA1 is required for LTP, no single 
phosphorylation site or protein-protein interaction in the GluAl 
C-tail has been shown to be absolutely necessary. Our goal was to 
find the minimum requirement of the GluA1 C-tail for LTP, and, if 
found, use that region to identify crucial protein interactions that 
mediate synaptic AMPAR potentiation. To accomplish this, we used 
a single-cell molecular replacement strategy to replace all endogen- 
ous AMPARs with transfected subunits'*”’. Using this approach, we 


systematically mutated the GluA1 C-tail and examined the effects on 
three stages of AMPAR trafficking in mice: surface expression, syn- 
aptic transmission and LTP. Surprisingly, we failed to identify any 
region in the GluA1 C-tail that was essential either for basal synaptic 
incorporation or for LTP. In fact, homomeric GluA2 receptors exhi- 
bited normal LTP. Most surprisingly, hippocampal synapses in which 
AMPARs had been replaced with kainate-type glutamate receptors 
(KARs) also expressed normal LTP. Only manipulations that severely 
compromised the extrasynaptic surface pool of receptors showed 
defects in potentiation. 


The role of the GluA1 C-tail in surface expression 


AMPAR trafficking can be broken down into three distinct steps: 
surface expression, basal synaptic targeting, and activity-dependent 
synaptic insertion. Because GluA1 is normally abundantly expressed 
on the neuronal surface*”®, we first screened for surface expression 
of various GluAl C-tail truncations in wild-type neurons using 
somatic outside-out patches in organotypic slice culture. Because 
overexpressed receptors form inwardly rectifying homomers’, whereas 
endogenous heteromeric receptors show linear current-voltage (J-V) 
relationships”’, we can detect surface expression of the expressed sub- 
units as an increase in surface rectification (Fig. 1a). Overexpression 
of full-length GluA1 by biolistic transfection into CA1 pyramidal neu- 
rons increased the rectification by approximately 40% compared to 
wild-type controls (Fig. 1b), indicating the presence of surface homo- 
mers. In contrast, overexpressing a GluAl subunit with a full C-tail 
truncation (AC) showed rectification similar to wild-type neurons 
(Fig. 1b), indicating an impairment in trafficking to the surface. How- 
ever, a less severe truncation up to amino acid 824 (GluA1(A824)), 
which removes two serine phosphorylation sites and the PDZ-binding 
domain, increased rectification to a similar degree as full-length 
GluA1. Selective excision of the remaining membrane proximal region 
(AMPR), which contains a well-characterized binding site of the 
protein 4.1N’*”’, also significantly increased rectification (Fig. 1b). 
Combined, these two modified subunits represent complementary 
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truncations of the entire C-tail, ruling out a necessary role for any single 
part of the C-tail for steady-state surface expression. 

Because competition with endogenous receptors may have hindered 
GluA1(AC) trafficking, we next studied surface expression in the 
absence of native AMPARs. To accomplish this, we used mice in which 
the genes coding for GluAl, GluA2 and GluA3 were flanked by loxP 
sites (Grial™" Gria2™" Gria3™", hereafter referred to as Grial-3"), 
A previous study has shown that expression of Cre into Grial-3"/" 
neurons results in a complete absence of AMPARs within 12-15 days’, 
providing an effective AMPAR-null background onto which mutant 
GluA1 subunits can be expressed. We confirmed that Cre expression 
eliminated all glutamate-evoked current from somatic outside-out 
patches of Grial-3"'" CA1 neurons, which can be rescued to control 
amplitudes by co-expression with full-length GluA1 (Fig. Ic, d), indi- 
cating full rescue of surface expression. Consistent with overexpression, 
molecular replacement with GluA1(AC) showed significantly decreased 
glutamate-evoked currents (Fig. 1c, d). This trafficking defect was not due 
to decreased association with TARPs (that is, auxiliary subunits impor- 
tant for AMPAR trafficking’*”*), as both full-length and GluA1(AC) 
subunits had KA/Glu ratios similar to control (Supplementary Fig. 1a). 
Also, both GluAl and GluA1(AC) replacement subunits showed 
strong inward rectification, confirming the absence of endogenous 
receptors (Supplementary Fig. 1b). Because both GluA1(AMPR) and 
GluA1(A824) showed normal surface trafficking, the GluA1(AC) sub- 
unit may be impaired owing to its severe truncation so close to the 
transmembrane region, which may inhibit proper protein folding. 


Synaptic transmission doesn’t require the GluA1 C-tail 


Given the decreased surface expression caused by complete trun- 
cation of the GluA1 C-tail, we next examined whether it would also 
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Figure 1 | The role of the GluA1 C-tail in surface trafficking. a, Experimental 
protocol and example trace showing voltage ramps applied to outside-out 
patches of control (black) and GluA1-overexpressing (green) CA1 neurons. 
Rectification was measured as the normalized glutamate-evoked current at +40 
mV over —70 mV. b, Full-length GluA1, GluA1(A824) and GluA1l(AMPR) 
significantly increased rectification of surface currents compared to control. 
Overexpression of GluA1(AC) slightly increased rectification (control, n = 47; 
GluAl1, n = 10, P< 0.001; GluA1(A824), n = 13, P< 0.001; GluAl(AMPR), 
n= 18, P<0.001; GluA1(AC), n = 8, P< 0.05). c, Cre expression eliminates 
glutamate-evoked currents in Gria1-3"" CA1 neuron outside-out patches, 
which is rescued to control levels by co-expression with full-length GluA1, but 
not GluA1(AC) (control, n = 28; Cre, n = 9, P< 0.001; GluA1, n = 11, P> 0.05; 
GluA1(AC), n = 15, P< 0.001). d, Example traces of glutamate-evoked current 
from (left to right) Grial -3"" control neurons, Cre-expressing neurons, GluA1, 
and GluA1(AC) replacement neurons. Scale bars: 1 s, 100 pA. Error bars 
represent mean + s.e.m. *P < 0.05; ***P < 0.001 in b, c. 
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impair basal synaptic targeting. Similar to surface currents, we 
assessed baseline synaptic transmission by transfecting GriaI-3"/" 
organotypic slice cultures with Cre and a replacement GluA1 subunit. 
After 17 days, we recorded evoked AMPAR excitatory postsynaptic 
currents (EPSCs) simultaneously from control and neighbouring 
GluAl-replacement CA1 neurons. Similar to previously described 
results”, full-length GluAl rescued AMPAR EPSC amplitudes to 
~68% of control cells, while leaving NMDAR EPSCs unchanged 
(Fig. 2a, c). However, these results contrast with previous studies 
showing that GluA1 only traffics to synapses after an LTP stimulus”’. 
For experimental exploration of this discrepancy, please refer to 
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Figure 2 | GluA1(AC) has normal synaptic targeting. Paired whole-cell 
recordings from control and Cre plus GluA1 or Cre plus GluA1(AC)-expressing 
CAI neurons in Gria1-3"" organotypic slice cultures. a, c, Full-length GluA1 
rescued synaptic AMPAR EPSCs to 68% of control cells (n = 13, P> 0.05), 
whereas NMDA EPSCs remained unchanged between control and transfected 
cells (P > 0.05). b, d, Replacement with GluA1(AC) results in 73% rescue of 
AMPA EPSCs without a change in the NMDA EPSC (n = 15, both P > 0.05). 
e, Replacement with GluA1 showed inwardly rectifying EPSCs (n = 8, P< 0.01). 
f, Summary graph of AMPA and NMDA EPSC rescue between GluA1 and 
GluA1(AC). Example traces show average EPSCs for paired control (black) and 
replacement (green) neurons. Scale bars: 20 ms (AMPA), 100 ms (NMDA), 

50 pA. Error bars represent mean + s.e.m. **P < 0.01 ine. 
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Supplementary Fig. 2. We also observed no change in paired-pulse 
ratio, indicating that GluAl molecular replacement did not affect 
presynaptic release probability (Supplementary Fig. 3a). Synaptic 
EPSCs from GluA 1-replacement CA1 neurons were strongly inwardly 
rectifying compared to control, confirming the absence of endogen- 
ous receptors (Fig. 2e and Supplementary Fig. 3b). Replacement 
with GluAl(AC) rescued AMPAR EPSCs to the same degree as 
full-length GluA1, and also had no effect on the NMDA EPSCs 
(Fig. 2b, d). Replacement with GluA1(A824) produced similar results 
(Supplementary Fig. 3c-e). This demonstrates that despite having 
markedly decreased somatic expression owing to its severe truncation, 
GluA1(AC) manages to rescue basal synaptic transmission effectively. 


GluA1 C-tail domains are not required for LTP 

To assess how GluA1 C-tail truncations affect LTP, we transfected Cre 
and GluA1 into the hippocampus of embryonic day ~E15.5 Grial- 
3" mouse embryos by electroporation. Like biolistic transfection, 
this results in sparse expression of transfected cells. Electroporation 
of Cre alone resulted in complete absence of an AMPAR EPSC by 
post-natal day 10 (P10) with no effect on NMDAR EPSCs, and no 
AMPAR EPSCs appeared after an LTP stimulus (Supplementary 
Fig. 4a—c). In P17-20 acute hippocampal slices, we induced LTP 
after recording stable (3-5 min) baseline AMPAR EPSCs simultane- 
ously from control and GluA1l-replacement neurons. We found that 
replacement with full-length GluA1 exhibited normal LTP (Fig. 3a), 
confirming that the GluA1 subunit is sufficient. To avoid the con- 
founding effect of decreased surface expression seen by GluA1(AC), 
we next assessed the competence of GluA1(A824) and GluA1(AMPR) 
subunits, which represent overlapping truncations of the entire C-tail. 
Both truncated subunits expressed LTP comparable to control 
(Fig. 3b, c), as did neurons replaced with a truncated GluA1(A824) 
subunit with S816A and S818A mutations (GluA1(A824-AA)) to 
specifically prevent 4.1N binding” (Supplementary Fig. 5a, b). We 
also found that expressing GluA1(AC) with GluA2 to produce more 
natural GluA1-GluA2 heteromers was able to rescue the surface traf- 
ficking defect of GluA1(AC) (Supplementary Fig. 6a-d), and show 
synaptic responses similar to controls (Supplementary Fig. 6e-h). 
Finally, LTP was fully rescued by replacement with GluA1(AC)- 
GluA2 (Fig. 3d). Combined, these data show that the GluA1 C-tail 
is not required for LTP. 


GluA2 is sufficient for LTP 


Given that no individual portion of the GluAl C-tail was necessary 
for LTP, we hypothesized that expression of an alternative AMPAR 
subunit might also rescue LTP. GluA2 is another such subunit with 
limited C-tail homology to GluA1 (ref. 15) that is normally highly 
expressed in CAI neurons, but is ineffective at forming homomers 
and trafficking to the cell surface*”®. This is attributable to Q/R RNA 
editing in the pore of the receptor, which severely limits channel 
permeability and may make formation of homomers energetically 
unfavourable”. Expression of unedited GluA2 with an R586Q muta- 
tion (GluA2(Q)) resulted in abundant appearance of homomers on 
the neuronal surface, as observed by increased rectification (Fig. 4a). 
Like GluA1(AC), GluA2((Q)AC) also showed impaired surface 
expression (Supplementary Fig. 7d). Similarly, both full-length 
GluA2(Q) (Fig. 4b) and GluA2((Q)AC) (Supplementary Fig. 7b) traf- 
ficked to the synapse, arguing against any necessary role for the GluA2 
C-tail in synaptic targeting, in agreement with previous experi- 
ments’?. NMDA EPSCs and paired-pulse ratio remain unchanged 
in these replacement neurons, and complete replacement of endogen- 
ous receptors was confirmed by ae a rectification (Supplementary 
Fig. 7a, b). Moreover, LTP in Grial- 34/4 neurons that expressed only 
GluA2(Q) was indistinguishable from control cells (Fig. 4c), despite 
lacking any of the intracellular phosphorylation sites and protein- 
protein binding sites of GluA1. Similarly intact synaptic targeting and 
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LTP was seen in a GluA2(Q) truncation that lacks most of its C-tail 
and known protein-interaction sites (Supplementary Fig. 7c-e). 


LTP requires a reserve pool of AMPARs 

Previous studies have shown that LTP is impaired in mice with con- 
stitutive deletion of GluA1 (ref. 8), but not GluA2 or GluA3 (ref. 9), 
demonstrating that GluA1 is both necessary and sufficient for LTP. 
These findings seem to contradict our data showing that GluA2(Q) 
homomers readily express LTP. We therefore re-examined the 
requirement for GluAl in single-cell conditional knockouts and 
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Figure 3 | LTP requires no single portion of the GluA1 C-tail. Paired whole- 
cell recordings from control CA1 neurons and neighbouring Cre plus GluA1- 
expressing neurons in P17-20 Gria1-3™" acute slices. a-d, LTP is similar to 
control in GluA1 (a), GluA(A824) (b), GluA1(AMPR) (c) and GluA1(AC) plus 
GluA2 (d) replacement neurons (GluA1, n = 11; GluA1(A824), n = 11; 
GluA1(AMPR), n = 20; GluA1(AC) plus GluA2, n = 11;all P > 0.05). Example 
traces show EPSCs before and 45 min after LTP induction in paired control 
(black) and GluA1-replacement neurons (green). Scale bars: 20 ms, 100 pA. 
Error bars represent mean + s.e.m. 
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found that conditional deletion of GluA1 alone did indeed impair LTP 
(Fig. 5a). Furthermore, deletion of GluA2 or GluA3 separately (Sup- 
plementary Fig. 8a, b) or in combination (Fig. 5b) had no effect. How 
can these data be reconciled with our previous experiments? One 
profound difference between deleting GluAl and deleting GluA2 
and/or GluA3 is that in the former condition there is an absence of 
extrasynaptic receptors**”°, whereas in the latter condition this pool 
remains entirely intact*. Also, unlike endogenous GluA2, our replace- 
ment GluA2(Q) showed abundant surface expression. We reasoned 
that perhaps it is the depletion of this pool that accounts for the loss of 
LTP in the GluA1 knockout. To test this possibility, we again used the 
extreme C-tail truncations of both GluAl and GluA2(Q), in which 
surface expression is impaired but synaptic targeting is maintained 
(Fig. 1b, cand Supplementary Fig. 7d). Indeed, LTP was substantially 
impaired in both GluA1(AC) and GluA2((Q)AC) replacement neu- 
rons (Fig. 5c, d). These findings suggest that the minimum require- 
ment for LTP is a reserve pool of extrasynaptic AMPARs, regardless of 
the subunit type. 


GluK1 is sufficient for mediating LTP 


Having failed to identify any specific domains in the C-tails that are 
important for LTP, we wondered whether other domains in the 
AMPAR are required. In search of a null condition to conduct 
domain-swapping experiments, we turned to KARs, a separate class 
of fast, ionotropic glutamate receptor which differs in fundamental 
ways from AMPARs. KARs bind to different auxiliary subunits and 
have no sequence homology in their C-tails”*. We therefore set out to 
replace all endogenous AMPARs with KARs at CA1 synapses. CA1 
pyramidal neurons do not express synaptic KARs, as shown by the 
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Figure 4 | GluA2(Q) is sufficient to express LTP. a, Overexpression of 
GluA2(Q) caused significantly increased surface rectification compared to 
control (control, n = 8; GluA2(Q), n = 14, P< 0.001). b, Paired whole-cell 
recordings between control and Cre plus GluA2(Q)-expressing Gria1-3"!" 
CA1 neurons show rescue of AMPA EPSCs (GluA2(Q), 1 = 16, P< 0.05). 
Average AMPA EPSC example traces are shown for paired control (black) and 
GluA2-replacement neurons (green). c, Expression of Cre plus GluA2(Q) 
shows LTP similar to control (n = 14, P> 0.05, minute 45). Example traces 
show average AMPA EPSCs before and 45 min after LTP induction. Scale bars: 
20 ms and 50 pA. Error bars represent mean + s.e.m. ***P < 0.001 ina. 
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absence of synaptic currents in the presence of the AMPAR-selective 
antagonist GYKI (4-(8-methyl-9H-1.3-dioxol[4,5-h][2,3]benzodia- 
zepine-5-yl)-benzamine dihydrochloride) (Supplementary Fig. 9a). 
However, co-expression of the KAR subunit GluK1 with the auxiliary 
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Figure 5 | Lack of surface expression corresponds with loss of LTP in GluA1 
conditional knockouts, and GluA1(AC) and GluA2((Q)AC) replacement 
neurons. a, Conditional GluA1 knockout cells (Gria1 ee plus Cre) demonstrate 
impaired LTP compared to control (n = 13, P< 0.001, 45 min). b, GluA2 and 
GluA3 knockout cells (Gria2™" Gria3" plus Cre) demonstrate comparable 
LTP to control (n = 6, P > 0.05, 45 min). c, d, Molecular replacement with 
either GluA1(AC) or GluA2((Q)AC) results in reduced expression of LTP 
(GluA1(AC), n = 16, P< 0.05; GluA2((Q)AC), n = 10, P< 0.05, both at 

45 min). Example traces show averaged AMPA EPSCs before and 45 min after 
induction of LTP in paired experimental neurons (green) and control cells 
(black). Scale bars: 20 ms, 50 pA. Error bars represent mean + s.e.m. 
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subunit Neto in wild-type CAl neurons generated a GYKI- 
resistant current that was blocked by NBQX (2,3-dioxo-6-nitro-1,2,3, 
4-tetrahydrobenzo(f)quinoxaline-7-sulfonamide), an antagonist that 
blocks both KARs and AMPARs (Supplementary Fig. 9a). This indicates 
that overexpressed KARs are capable of being targeted to the synapse and 
contribute to EPSCs. To examine KAR currents in isolation, we co- 
expressed Cre with GluK1 and Neto2 in Grial-3" CA1 neurons. In this 
case, we recorded a population of pure KARs on the surface that desen- 
sitize to glutamate even in the presence of cyclothiazide and are com- 
pletely blocked by ACET, a highly specific GluK1 antagonist” (Fig. 6a). 
Furthermore, in this AMPAR-null background, these neurons exhibit 
EPSCs that are entirely blocked by ACET, whereas the EPSCs in neigh- 
bouring control neurons are unaffected (Fig. 6b), further demonstra- 
ting that exogenously expressed KARs are capable of being targeted to 
synapses. As with AMPAR replacement, NMDA EPSCs were unaffected 
(Supplementary Fig. 9b). Finally, we tested whether neurons expressing 
only KARs could express LTP. Unexpectedly, we found that the KAR 
EPSCs showed potentiation indistinguishable from that recorded simul- 
taneously from neighbouring control neurons (Fig. 6c). To ensure that 
the EPSC in the KAR-expressing neuron was mediated entirely by KARs, 
we applied ACET at the end of the experiments and found that it abo- 
lished the EPSC, but had no effect on neighbouring control neurons 
(Fig. 6c). We also wanted to confirm that LTP mediated by KARs, 
which are also Ca**-permeable, was not induced by a fundamentally 
different mechanism than wild-type LTP. We therefore tried inducing 
LTP in the presence of the NMDAR antagonist AP5 (p(-)-2-amino-5- 
phosphonovaleric acid), and saw no significant potentiation (Sup- 
plementary Fig. 9d, e). These experiments demonstrate that even neurons 
completely lacking AMPARs can undergo LTP, as long as they are pro- 
vided with an alternative fast, ionotropic glutamate receptor. 


Discussion 


Using a single-cell molecular replacement approach that gave us com- 
plete control over the complement of expressed AMPA receptors, we 
found no requirement for the GluA1 C-tail for basal synaptic trans- 
mission or for LTP. In fact, we found no requirement for the GluA1 
subunit generally, as both GluA2(Q), another AMPAR subunit, and 
GluK1, an entirely separate class of glutamate receptor, exhibited 
normal levels of LTP. Previous studies that have implicated the 
GluA1 C-tail in LTP demonstrated phenotypes with a largely normal 
initial stage of potentiation, followed by a gradual decrease in EPSC 
amplitude towards baseline******. The most compelling of these stud- 
ies demonstrates impaired LTP in mice with phospho-null knock-in 
mutations of two key phosphorylation sites, and complete absence of 
long-term depression (LTD)**. Another study with phospho-mimetic 
knock-in mutations also demonstrated a decreased threshold for LTP 
induction**. Given these findings, we cannot rule out the possibility 
that the C-tails have some modulatory effect on synaptic plasticity. 
In the present experiments, however, we saw immediately impaired 
potentiation in GluAl conditional knockout cells and cells with 
GluA1(AC) and GluA2(AC) replacement, which more closely mimics 
the absence of LTP seen with pharmacological blockade of NUDA 
receptors. With all three of these manipulations, there was a profound 
decrease in the pool of extrasynaptic receptors, indicating that the 
main requirement for LTP is an adequate reserve pool of gluta- 
mate receptors. Another equally plausible model is that AMPAR- 
containing recycling endosomes are required for LTP, and conditions 
that deplete the surface receptor pool also deplete this pool** (see 
Supplementary Discussion). 

Fundamentally, our results suggest that synapses can accumulate a 
broad variety of receptors after LTP, shifting the focus of LTP expres- 
sion from the receptor subunits to the synapse itself and specifically 
the postsynaptic density (PSD). Our data suggest a model in which 
AMPARs freely diffuse on the neuronal surface, and are trapped at the 
PSD for use in synaptic transmission*’. LTP, then, can be understood 
as an immediate increase in the ability of the PSD to trap receptors 
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that relies on a reserve pool of freely diffusing surface receptors. This 
model is consistent with evidence from two-photon glutamate unca- 
ging experiments, which show an immediate increase in the volume of 
postsynaptic spines after LTP induction*®**, suggesting significant 
alterations to the synapse and PSD. Despite this shift of focus, research 
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Figure 6 | GluK1 expresses on the neuronal surface, targets to synapses and 
mediates LTP. a, Co-expression of Cre, GluK1 and Neto2 in Gria1-3" 
neurons results in robust glutamate-evoked currents from somatic outside-out 
patches (n = 10). The current desensitizes in the presence of 100 1M 
cyclothiazide (CTZ), and is completely blocked by 1 uM ACET. b, Paired 
recordings from Cre/GluK1/Neto2-expressing and neighbouring control CA1 
neurons resulted in a 33% rescue of synaptic EPSCs (n = 20, P< 0.001). 
Example trace (inset) shows paired control (black) and GluK1-replacement 
(green) EPSCs. 1 1M ACET completely blocks the GluK1 replacement EPSCs 
(green example traces, upper middle), with no block of control cell EPSCs 
(black example traces, lower middle) (n = 14, P< 0.001). c, Paired whole-cell 
recording from control and Cre/GluK1/Neto2-expressing Grial-3""" CA1 
neurons shows similar levels of LTP (n = 12, P > 0.05, minute 45). 1 p.M ACET 
completely blocks the GluK1-replacement EPSC, but not control (n = 11, 
P<0.001, minute 60). Example traces show average EPSCs before and 45 min 
after LTP induction in control (black) and GluK1-replacement neurons 
(green). Scale bars: 1 s (a), 20 ms (b, c) and 50 pA (a-c). Error bars represent 
mean + s.e.m. ***P < 0.001 in b. 
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on AMPARs and their auxiliary subunits, such as TARPs, remain 
important for identifying LTP-related PSD proteins. In the absence 
of a role for the GluAl C-tail, the question remains exactly which 
specific interactions cluster AMPARs at the synapse both basally and 
during plasticity. Identification of these interactions may be crucial to 
understanding the synaptic modifications that underlie learning in 
the brain. 


METHODS SUMMARY 

Electrophysiology and neuronal transfection. Whole-cell and outside-out patch 
recordings were performed as previously described’. Slice cultures were prepared 
on P6-8 as previously described*? and recorded on day in vitro (DIV) 9-24 
depending on the experiment. Acute slices for LTP experiments were prepared 
between P17-23. All slices were maintained during recording in artificial cerebral 
spinal fluid (aCSF) containing (in mM): 119 NaCl, 2.5 KCl, 1 NaH,POy,, 26.2 
NaHCO; and 11 glucose. For acute slices, 2.5 mM CaCl, and 1.3 mM MgSO, were 
added to the aCSF, and 4 mM CaCl, and MgSO, were added for organotypic slice 
cultures. The internal whole-cell recording solution contained (in mM): 135 
CsMeSO,, 8 NaCl, 10 HEPES, 0.3 EGTA, 5 QX-314, 4 Mg-ATP, 0.3 Na-GTP 
and 0.1 spermine. Osmolarity was adjusted to 290-295 mOsm, and pH buffered 
at 7.3-7.4. Synaptic responses were evoked by stimulating with a monopolar glass 
electrode filled with aCSF in stratum radiatum of CA1. Biolistic transfections and 
~E15.5 electroporations were carried out as previously described** ’. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Mouse genetics. Animals were housed according to the IACUC guidelines at the 
University of California, San Francisco. Gria1-3"!" mice were generated and 
genotyped as previously described’. 

Experimental constructs. Flip-isoform GluA1, GluA2(Q) and Cre:mCherry 
were cloned into the pFUGW expression plasmid by PCR and In-Fusion HD 
Cloning System (Invitrogen). pFUGW-GluA1 and GluA2(Q) co-expressed with 
GFP behind an internal ribosomal entry site (IRES). GluA1 and GluA2(Q) trun- 
cations were generated by overlapping extension PCR. GluA1(AC) ended in 
amino acid 812, with the last four amino acids being EFCY. GluA1(A824) ended 
in amino acid 824, with the sequence MKGF. GluA(A824-AA) contained the 
C-tail sequence EFCYKSRAEAKRMKGEF. GluA1(AMPR) had the following 
amino acids excised from the C-tail: KSRSESKRMKGEFC, with the rest of the 
C-tail intact. GluA2((Q)AC) also truncated to amino acids EFCY, and 
GluA2((Q)A847) ended in amino acids MKGF. GluK1 and Neto2 were cloned 
into the pCAGGs expression plasmid with GFP and mCherry, respectively, co- 
expressed behind an IRES. 

Neuronal transfection. Sparse biolistic transfections of organotypic slice cultures 
were performed as previously described*””. Briefly, 80 tg total of mixed plasmid 
DNA was coated on 1 u.M-diameter gold particles in 0.5 mM spermidine, pre- 
cipitated with 0.1 mM CaCl, and washed four times in pure ethanol. The gold 
particles were coated onto PVC tubing, dried using ultra-pure N> gas, and stored 
at 4 °C in desiccant. DNA-coated gold particles were delivered with a Helios Gene 
Gun (BioRad). Cre expression was confirmed by mCherry epifluorescence, and 
replacement AMPA/KAR subunits confirmed by GFP epifluorescence. 

For in utero electroporations, ~E15.5 pregnant Gria1-3"" mice were anaes- 
thetized with 2.5% isoflurane in O2 and injected with buprenorphine for anal- 
gesic. Embryos within the uterus were temporarily removed from the abdomen 
and injected with 2 1l of mixed plasmid DNA into the left ventricle via a bevelled 
micropipette. pFUGW-Cre:mCherry was typically diluted to approximately 
0.5 pg ul ' in 2-3 gpl’ of the replacement pFUGW AMPAR or pCAGGS 
GluK1 plasmid. Each embryo was electroporated with 5 X 50 ms, 35 V pulses. 
The positive electrode was placed in the lower right hemisphere and the nega- 
tive electrode placed in the upper left hemisphere’. After electroporation, the 
embryos were sutured into the abdomen, and killed on P17-20 for LTP recording. 
For further detail on electroporation, please see ref. 42. 

Electrophysiology. Voltage-clamp recordings were taken from CA1 pyramidal 
neurons in either acute hippocampal slices or organotypic slice cultures. For acute 
slices, 300 [1M transverse slices were cut using a Microslicer DTK-Zerol (Ted 
Pella) in chilled high sucrose cutting solution containing (in mM): 2.5 KCl, 7 
MgSO,, 1.25 NaH PO,, 25 NaHCOs, 7 glucose, 210 sucrose, 1.3 ascorbic acid, 3 
sodium pyruvate. The slices were then incubated for 30 min at 34°C in artificial 
cerebral spinal fluid (aCSF) containing (in mM): 119 NaCl, 2.5 KCl, 1 NaHPO,, 
26.2 NaHCO; and 11 glucose. For acute slices, 2.5 mM CaCl, and 1.3 mM MgSO, 
were added to the aCSF, and for organotypic slice cultures 4mM CaCl, and 
MgSO, were added. The aCSF was bubbled with 95% O, and 5% CO, to maintain 
pH, and the acute slices allowed to recover at room temperature for 45 min to 1h. 
Cultured slices were prepared as previously described*’, and recorded between 
7-24 days in vitro (DIV) depending on the experiment. During recording, slices 
were transferred to a perfusion stage on an Olympus BX51WI upright microscope 
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and perfused at 2.5 ml min” ' with aCSF containing 0.1 mM picrotoxin for acute 
slices experiments, and 0.01mM gabazine, and 2-5 uM 2-Cl adenosine for 
organotypic slice cultures. Synaptic responses were evoked by stimulating with 
a monopolar glass electrode filled with aCSF in stratum radiatum of CA1. To 
ensure stable recording, membrane holding current, input resistance, and pipette 
series resistance were monitored throughout recording. Data were gathered 
through a MultiClamp 700B amplifier (Axon Instruments), filtered at 2 kHz, 
and digitized at 10 kHz. 

Whole-cell synaptic recordings and LTP. Simultaneous dual whole-cell record- 
ings were made between GFP and/or mCherry positive experimental cells as iden- 
tified by epifluorescence, and neighbouring non-transfected control cells. Internal 
recording solution contained (in mM): 135 CsMeSO,, 8 NaCl, 10 HEPES, 0.3 EGTA, 
5 QX-314, 4 Mg-ATP, 0.3 Na-GTP, and 0.1 spermine. Osmolarity was adjusted to 
290-295 mOsm, and pH buffered at 7.3-7.4. AMPAR- and KAR-mediated res- 
ponses were isolated by voltage-clamping the cell at —70 mV, whereas NMDA 
responses were recorded at + 40 mV, with amplitudes taken 100 ms after stimulation 
to avoid contamination by AMPA receptor current. Paired-pulse ratios of AMPAR 
EPSCs were taken by stimulating twice at a 40-ms interval. To examine AMPA 
receptor rectification, 0.1 mM AP5 (p(-)-2-amino-5-phosphonovaleric acid) was 
washed in to block NMDA receptors. LTP was induced by stimulating at 2 Hz for 
90s while clamping the cell at 0 mV, after recording a stable 3-5 min baseline, but 
not more than 6 min after breaking into the cell. To minimize run-up of baseline 
responses during LTP, slices were stimulated for ~10 min before breaking in, and 
both cells held cell-attached for 2-5 min before breaking into the whole cell. Before 
breaking in, stimulation intensity was calibrated just below the threshold required to 
elicit an action potential from the wild-type control neuron. Rectification was cal- 
culated as the ratio of the slopes of the lines connecting AMPA EPSC amplitude from 
0 to +40 mV and from —70 mV to 0 mV. This calculation can be taken as follows: 
RI = 7(Is0 - Io)/4(Io - I70) where I, represent EPSC amplitude at x mV. 
Outside-out patches. Outside-out patches were taken from CAI cells by obtaining 
whole-cell access to CA1 pyramidal neurons at -70 mV witha 4-5 MQ patch pipette, 
then slowly pulling the pipette away from the soma until a high-resistance seal 
reformed. HEPES-aCSF containing (in mM) 150 NaCl, 2.5 KCl, 10 HEPES, 10 
glucose, 1 MgCl, 2 CaCl, 0.1 AP5, 0.1 picrotoxin, 0.1 cyclothiazide and 0.5 1M 
TTX was then perfused over the tip of the pipette. Glutamate and kainate currents 
were evoked by perfusion of HEPES-ACSF containing 1 mM L-glutamic acid and 
1mM kainic acid, respectively. A ValveLink 8 (AutoMate Scientific Inc.) was used 
for fast perfusion of control, glutamate and kainate containing HEPES-aCSF. During 
outside-out patch experiments, experimental cells were interleaved with non- 
transfected control cells. Rectification was calculated as in synaptic experiments. 
Statistics. For all experiments involving unpaired data, including all outside-out 
patch data, a Mann-Whitney U-test with Bonferonni correction for multiple 
comparisons was used. For all experiments using paired whole-cell data, includ- 
ing all synaptic replacement and synaptic overexpression, a two-tailed Wilcoxon 
signed-rank test was used. LTP data were gathered as pairs of control and experi- 
mental neurons, but occasionally during experiments one of the cells would be 
lost. Comparisons were therefore made using the Mann-Whitney U-test, and the 
reported n values represent the number of cells at the end of each experiment. 
Data analysis was carried out in Igor Pro (Wavemetrics), Excel (Microsoft), and R 
(The R Project for Statistical Computing, http://www.r-project.org/). 
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Energy release in the solar corona from spatially 


resolved magnetic braids 


J. W. Cirtain’, L. Golub’, A. R. Winebarger', B. De Pontieu’, K. Kobayashi‘, R. L. Moore!, R. W. Walsh®, K. E. Korreck, M. Weber’, 


P. McCauley’, A. Title?, S. Kuzin® & C. E. DeForest’ 


It is now apparent that there are at least two heating mechanisms in 
the Sun’s outer atmosphere, or corona’*. Wave heating may be the 
prevalent mechanism in quiet solar periods and may contribute to 
heating the corona to 1,500,000 K (refs 1-3). The active corona 
needs additional heating to reach 2,000,000-4,000,000 K; this heat 
has been theoretically proposed® ” to come from the reconnection 
and unravelling of magnetic ‘braids’. Evidence favouring that pro- 
cess has been inferred’*"*, but has not been generally accepted 
because observations are sparse and, in general, the braided mag- 
netic strands that are thought’*'*"” to have an angular width of 
about 0.2arc seconds have not been resolved’®'**°. Fine-scale 
braiding has been seen”’” in the chromosphere but not, until now, 
in the corona. Here we report observations, at a resolution of 0.2 arc 
seconds, of magnetic braids in a coronal active region that are recon- 
necting, relaxing and dissipating sufficient energy to heat the struc- 
tures to about 4,000,000 K. Although our 5-minute observations 
cannot unambiguously identify the field reconnection and sub- 
sequent relaxation as the dominant heating mechanism throughout 
active regions, the energy available from the observed field relaxation 
in our example is ample for the observed heating. 

Observations of the solar atmosphere in the visible and infrared 
spectral ranges have been made from ground-based instruments on 
~0.2-arcsec spatial scales, in particular in Ha emission, and these show 
coherent structures 0.2 arcsec in diameter”’. But for the ultraviolet and 


a AIA 193 A: 11 July 2012 18:55:07 


X-ray emission corresponding to temperatures in excess of 100,000 K, 
there is a paucity of data. Detecting these wavelengths requires the use 
of instruments outside Earth’s atmosphere, and observations so far, 
including those of the NASA Atmospheric Imaging Assembly” (AIA) 
instrument at present operating on the Solar Dynamics Observatory 
(SDO), have been limited to a resolution of about 1.0 arcsec. The High- 
resolution Coronal Imager (Hi-C; see Supplementary Fig. 1) was 
launched on a sounding rocket on 11 July 2012 and took images of 
the 1,500,000-Kelvin (1.5-MK) corona with a resolution of 0.2 arcsec 
(roughly 150 km). The Hi-C passband isolates a very narrow window 
in the extreme ultraviolet, centred at 193 A. It is dominated by emission 
lines of Fe xu with a peak temperature of formation of 1.5 MK. AIA 
has an identical channel that images the full solar disk (Fig. 1; see also 
Supplementary Video 1). 

The data from Hi-C show evidence of magnetic field braiding and 
axial twist in loops along their length, reconnection and consequent 
heating in the low corona, whereas the simultaneous images from AIA/ 
SDO ata resolution of 1.2 arcsec (900 km) do not resolve the braiding. 
We have studied the intensity of emission in a loop at several AIA 
wavelengths covering more than two orders of magnitude in temper- 
ature (Fig. 2), and found that the twist in the structures increases 
during the observations (Supplementary Fig. 2 and Supplementary 
Video 2). After the end of the Hi-C observations, a small flare observed 
by AIA peaks at the intersection of the converging strands (indicated 


b Hi-C 193 A: 11 July 2012 18:55:20 


Figure 1 | The 1.5-MK sun. a, The co-temporal 1-arcsec-resolution AIA full- 
sun image (11 July 2012, 18:55 ur) using the 193 A passband. The field of view 
for the Hi-C rocket flight is indicated by the box. b, The full Hi-C field-of-view 


image. The two boxes show the locations of the examples that are discussed in 
the text and shown in Fig. 2 (box 1) and Fig. 3 (box 2). The Hi-C experiment is 
described in Supplementary Fig. 1. 
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@ AIA 304 A: He 1 (0.1 MK) 18:55:20 BAIA 171 A: Fe w/x (1 MK) 18:55:24 © Hi-C unsharp masked image: 18:56:04 Figure 2 | A coronal loop seen at several different 
= coronal temperatures by AIA and Hi-C. Labels 
indicate the passbands, the dominant emitting ion 
and the peak formation temperature of the ion. The 
image in c was constructed by smoothing the 
original image (e) and subtracting the processed 
image from the original. This ‘unsharp mask’ 
version of the Hi-C data enhances the shapes of 
fine-scale structures in the image relative to the 
image background. All panels show a narrow 
magnetic arch that Hi-C resolves to be twisted along 
its length. The twisted loops converge and appear to 
intersect (arrow in c). A C1.7 flare centred on this 
intersection is seen in the AIA images about 3 min 
after the Hi-C flight (Supplementary Video 1). 


10,000 km 


dAIA 193 A: Fe xi (1.5 MK) 18:55:19 @ Hi-C 193 A: Fe xi (1.5 MK) 18:56:04 — f AIA 94 A: Fe xvi (6.3 MK) 18:55:26 


by an arrowin Fig. 2c, Supplementary Fig. 2 and Supplementary Video 2). _ strength, 10° G; see magnetogram in Supplementary Fig. 3), along which 
At the location of the flare, strong (150 km s_') outflows of plasma from __ strong shearing flows are present (Supplementary Video 3). Using the 
the reconnection region are observed. These twisting loops are sheared _ Extreme-ultraviolet Imaging Spectrometer” observations from the Japan 
across a polarity inversion line in magnetic field in the photosphere (field | Aerospace Exploration Agency’s Hinode spacecraft, we have determined 


A@Hi-C 193 A: 18:53:28 b Hi-c 193 A: 18:53:45 C Hi-C 193 A: 18:54:13 


10,000 km 


Figure 3 | A time series from Hi-C data. Box A is centred ona location inthe box A, discernible by comparing c with f (see also Supplementary Video 2). 
braided structure where multiple loops converge and wind about each other.In Area B in b highlights a section of the loop ensemble that is found to unwind or 
a, we trace along several of the features to guide the eye. As the time sequence _ simplify during the observation period. 

proceeds, the loops unwind, as evidenced by the structural changes shown in 
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the density of the loops to be 10'°e~ cm * (where e is electrons) and the 
peak temperature to be ~7 MK. The structure heats to this peak tempera- 
ture in 42 s, and then cools in roughly the radiative cooling time, 13 min. 
The example shown in Fig. 2 is found to generate many such transient 
events near the location indicated with the arrow (Fig. 2c). AIA data in 
several passbands show repeated impulsive events, but at the lower reso- 
lution of AIA only the increases in intensity are observable—the struc- 
tural changes in the loops remain unresolved. 

Ina second example, Hi-C observed a small bundle of loops that are 
wrapped or braided about each other along their length (Fig. 3). The 
AIA telescopes also observed this bundle of loops, but saw it as a single 
structure. The braided loop ensemble is visible over the temperature 
range 0.1-3 MK. Higher-temperature loops are rooted close to the 
footpoints of the braided loops, that is, the points at which the loops 
intersect the photosphere, and are observed in the soft-X-ray range at 
4-6 MK. In boxA at the end of the braided structure (Fig. 3b), the 
twist in the loops simplifies and eastward high-speed outflows at 100- 
150kms_ along this fan of loops persist throughout the observations 
(Supplementary Video 3). The bundle’s axial field strength estimated 
from observations is 500G. We can determine the pitch angle of 
the twist of the bundle by estimating the number of braids along the 
length of the bundle (Fig. 3a) and, hence, the strength of the azimuthal 
component (By) of the magnetic field. We count roughly five braids, 
giving an azimuthal field of order 100 G. The amount of free energy is 
of order (By) V/8r, where V is the volume of the bundle, which we 
have estimated to be approximately 10'’ km’. This yields a total free 
energy of order 10°’ erg. 

At plasma temperatures of 1 MK, the speed of sound is ~100kms_ ! 
and the observed outflows at ~100kms ' are therefore indicative of 
plasma pressure driving mass motion along the field as the plasma is 
heated locally and expands at the speed of sound into lower-pressure 
locations along the braided loops. As is the case for the first example, 
this braided bundle seems to be heated as the free energy stored in the 
twisting or braiding of the magnetic field is dissipated by means of 
reconnection. We have summed the emission observed by AIA from 
the bundle in area B in Fig. 3e to form light curves (Fig. 4; see also 
Supplementary Video 4). These curves show the duration of the heating 
and the initial sharp increase in emission and subsequent decay. The 
event lasts for about 12 min, for the return from the peak to the basal 
emission level. The radiated energy from this event can be calculated 
from the light curves and density and is found to be ~10°° erg: about 
~0.1% of the stored energy in the braided ensemble is converted into 
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Figure 4 | The light curves for example two. The light curves from multiple 
AIA channels for area B in Fig. 3b show the increase in intensity for the braided 
ensemble of loops just before the Hi-C flight. The start and end of the Hi-C flight are 
indicated by the vertical dashed lines in the plot. The intensities for all passbands 
decrease over a 10-min period. The heating process shown here repeats several 
times per hour over the 12 or more hours the structure is present in the AIA data. 
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thermal energy to power the observed radiation. This braided bundle of 
loops, seen in the AIA data as a single broader loop, is observed for over 
12h and brightens repeatedly over the AIA observation period. 
Several instances of braiding and loop-to-loop interaction were visi- 
ble to Hi-C during its brief flight: the two examples seen here are the 
clearest. These observations demonstrate that the active region corona, 
at a spatial scale of approximately 150 km, is replete with current-laden 
locations that provide sites for the dissipation of magnetic free energy. 
The observations show magnetic fields that are braided, and not sim- 
ply twisted. By reconnecting, presumably at current sheets between 
entwined flux strands, the braided loops release energy into the corona. 
The braiding is driven by the ubiquitous small-scale, convection- 
driven motion of the photospheric feet of the magnetic field, and thus 
provides a d.c. (that is, non-wavelike) energy source for the corona. 
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Laser cooling of a semiconductor by 40 kelvin 


Jun Zhang'*, Dehui Li’*, Renjie Chen! & Qihua Xiong! 


Optical irradiation accompanied by spontaneous anti-Stokes emis- 
sion can lead to cooling of matter, in a phenomenon known as laser 
cooling, or optical refrigeration, which was proposed by Pringsheim 
in 1929". In gaseous matter, an extremely low temperature can be 
obtained in diluted atomic gases by Doppler cooling’, and laser 
cooling of ultradense gas has been demonstrated by collisional 
redistribution of radiation’. In solid-state materials, laser cooling 
is achieved by the annihilation of phonons, which are quanta of 
lattice vibrations, during anti-Stokes luminescence. Since the first 
experimental demonstration in glasses doped with rare-earth metals’, 
considerable progress has been made, particularly in ytterbium- 
doped glasses or crystals: recently a record was set of cooling to 
about 110 kelvin from the ambient temperature, surpassing the 
thermoelectric Peltier cooler*®. It would be interesting to realize 
laser cooling in semiconductors, in which excitonic resonances 
dominate’ ’, rather than in systems doped with rare-earth metals, 
where atomic resonances dominate. However, so far no net cooling 
in semiconductors has been achieved despite much experimental’®” 
and theoretical’”*’*"* work, mainly on group-III-V gallium arsenide 
quantum wells. Here we report a net cooling by about 40 kelvin ina 
semiconductor using group-II-VI cadmium sulphide nanoribbons, 
or nanobelts, starting from 290 kelvin. We use a pump laser with a 
wavelength of 514 nanometres, and obtain an estimated cooling 
efficiency of about 1.3 per cent and an estimated cooling power of 
180 microwatts. At 100 kelvin, 532-nm pumping leads to a net 
cooling of about 15 kelvin with a cooling efficiency of about 2.0 
per cent. We attribute the net laser cooling in cadmium sulphide 
nanobelts to strong coupling between excitons and longitudinal 
optical phonons (LOPs), which allows the resonant annihilation 
of multiple LOPs in luminescence up-conversion processes, high 
external quantum efficiency and negligible background absorp- 
tion. Our findings suggest that, alternatively, group-II-VI semi- 
conductors with strong exciton-LOP coupling could be harnessed 
to achieve laser cooling and open the way to optical refrigeration 
based on semiconductors. 

Optical refrigeration has the advantages of compactness, freedom 
from vibration and the need for cryogen, high reliability’*, and could 
have applications in all-solid-state cryocoolers’® and athermal or self- 
cooling lasers’*. Although net cooling has primarily been demon- 
strated in a variety of rare-earth metal (REM)-doped glasses and 
crystals**"*""’, the laser cooling of direct-bandgap semiconductors, 
for example gallium arsenide (GaAs), is more appealing because semi- 
conductors allow more efficient pump light absorption, much lower 
achievable cooling temperatures and direct integrability into electronic 
and photonic devices'*’*"*. Nevertheless, the net cooling of a GaAs- 
based semiconductor has inherent impediments due to high back- 
ground absorption and low luminescence extraction efficiency, although 
anti-Stokes up-conversion can be readily achieved?"""*"*. 

According to Sheik-Bahae/Epstein (SBE) theory’””, the cooling effi- 
ciency, 7.(hy, T), of optical refrigeration in semiconductors is given by 
VED (1) 


vy 


(hy, T) =Nexe!abs 


Here v is the pump laser frequency, T is the absolute temperature of the 
sample, v¢(T) is the mean emission frequency and h is Planck’s con- 
stant; exe is the external quantum efficiency, written as Nexe = NeWraal 
(NeWraa + War), where 17. is the luminescence extraction efficiency and 
Wyad and W,, are respectively the radiative and non-radiative recom- 
bination rates; and 77,,,= [1 + %/a(v,T)]~' is the absorption effi- 
ciency, quantifying the percentage of photons absorbed that are 
engaged in cooling, where o, is the background absorption coefficient 
and «(v, T) is the semiconductor absorption coefficient. Net laser cool- 
ing requires 1.(hv, T) >0, which might be achieved in three related 
ways: having a large energy difference AE = hv;(T) — hv; having Nexe 
approach unity; and having 17,5; approach unity through the mini- 
mization of the background absorption, ,. 

In the classical Pringsheim picture applied to solids, each cooling 
cycle removes AE = hyv;(T) — hv ~ kgT during the thermalization of 
cold electrons and holes, owing to absorption of various phonons. In 
GaAs, single LOP-assisted transitions in band-tail absorption (Urbach 
tail) have been proposed to facilitate laser cooling”. It has also been 
proposed that the band structure can be engineered, for example by 
introducing donor-acceptor pairs, which modifies the density of states 
and could lead to a larger AE, making the dependence on the other two 
factors (exe and 17,4,) less important. However, experimentally that is 
difficult to achieve because the doping density and binding energies of 
both donors and acceptors have to be simultaneously optimized. We 
find that strong exciton-LOP coupling in II-VI semiconductors such 
as cadmium sulphide (CdS) can be harnessed to facilitate laser cooling 
by the annihilation of one or more LOPs, leading to the removal of 
several kT units of heat in each cooling cycle. 

Figure 1a shows the anti-Stokes photoluminescence spectra of a CdS 
nanobelt excited by a 532-nm laser (bottom) and a 514-nm laser (top) 
at 294 K at three different pumping powers. Strong anti-Stokes photo- 
luminescence with a peak position of ~506 nm is identified, facilitated 
by resonant annihilation of multiple LOPs (one for the 514-nm laser 
and three for the 532-nm laser; hv,.9 ~ 37 meV, or 300cm_'). We note 
that 514-nm pumping led to much stronger up-conversion, owing to 
stronger band-tail absorption. Systematic study of the anti-Stokes 
photoluminescence intensity as a function of laser power (Fig. 1b) 
indicates that two-photon absorption can be excluded at low-to- 
moderate power (below ~12 mW), at which our laser cooling experi- 
ments are conducted. Further analysis of the Stokes and anti-Stokes 
Raman spectra reveals that both the LOP anti-Stokes component and 
the 2LOP anti-Stokes component are stronger than their Stokes coun- 
terparts, suggesting that multiple-LOP annihilation dominates over 
the LOP creation process. This dominance of the anti-Stokes compo- 
nent can be readily achieved in CdS under resonant conditions, that is, 
when the laser energy is red-detuned from the exciton energy by roughly 
an integer multiple of hvyo (Supplementary Fig. 3a). Following the 
theory in ref. 22, we evaluate the Huang-Rhys factor, S, which quantifies 
the exciton-LOP coupling strength (Supplementary Information, part 2). 
As shown in Supplementary Fig. 3b, we find that S is considerably 
enhanced in CdS on the nanoscale, in good agreement with a recent 
report on ZnTe nanorods”. Therefore, we speculate that the resonant 
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Figure 1 | Multiple-LOP-assisted up-conversion spectra of CdS nanobelts. 
a, Room temperature anti-Stokes photoluminescence (ASPL) spectra of a 
single, 100-nm-thick CdS nanobelt excited by a 514-nm (top) and a 532-nm 
(bottom) laser at three power levels. b, Power dependence of anti-Stokes 


annihilation of one or more LOPs facilitates the up-conversion in CdS 
nanobelts owing to the resulting increase in the exciton-LOP coupling, 
leading to the removal of multiple k,T units of heat in each cooling cycle. 
The microscopic mechanism by which multiple LO phonons are actually 
involved in the up-conversion still remains to be addressed within the 
framework of higher-order exciton-LOP coupling. 
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Figure 2 | Surface plot of AE asa function of temperature and pump power. 
a, Anti-Stokes photoluminescence spectra taken from 77 to 340 K using an 8.0- 
mW, 532-nm laser. As the temperature is decreased from 340 to 180 K, the photo- 
luminescence peak gradually blueshifts. Below ~ 180 K, donor-acceptor pair 
(DAP) emission becomes pronounced. Both DAP and exciton emissions blueshift 
marginally. b, Anti-Stokes photoluminescence spectra of the same nanobelt 
excited by a 4.0-mW, 514-nm laser. Below 200 K, DAP emission peaks appear and 
dominate the spectra. The spectra are offset vertically for clarity, with vertical 
dashed lines indicating the wavelengths of the pump lasers and dashed boxes 
indicating the first-order anti-Stokes (—520 cm™ '), first-order Stokes (520 cm‘) 
and second-order (980 cm ') Raman features from the silicon substrate in a and 


photoluminescence pumped at 532 and 514nm and at 294 K. Below ~12 mW, 
the intensity scales linearly with power. Two-photon-absorption-induced 
photoluminescence starts to take effect only at high power. Aasp1, integrated 
peak area of anti-Stokes photoluminescence. a.u., arbitrary units. 


Next we systematically evaluate the dependence of AE on pump 
power and temperature (between 77 and 340K) when pumped by 
514- and 532-nm lasers (Methods Summary). Cooling is possible only 
if AE is positive. Therefore, the sign and magnitude of AE at a defined 
temperature have important implications for the cooling potential. 
Figure 2a and Fig. 2b show the temperature evolution of anti-Stokes 
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b. c, AE asa function of temperature and power for 532-nm pumping. The maxi- 
mum energy difference, of ~173 meV, is obtained at around 180 K. Below 180 K, 
DAP emission peaks (a) appear and lead to pronounced redshifts of the mean 
wavelength of luminescence. d, AE surface plot for 514-nm pumping. The maxi- 
mum AE, of ~52.5 meV, is achieved at around 200 K. Below 200 K, the DAP 
emission peaks (b) appear and dominate the spectra, leading to the redshift of the 
mean wavelength of luminescence. Because the DAP energy levels resonate 
approximately with the pump laser, the DAP emission is much stronger than the 
exciton emission, which causes AE to be negative below 180 K, suggesting that no 
net cooling is achievable using 514-nm pumping. In both c and d, Si Raman 
features are excluded before the mean emission wavelength is evaluated. 
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photoluminescence spectra for laser excitation at 532 nm (8.0 mW) 
and 514nm (4.0 mW), respectively. Detailed peak assignment to the 
excitons and donor-acceptor pairs (DAP) is referred to in Sup- 
plementary Information, parts 2-3. By extracting the mean emission 
frequency, we can obtain surface plots of AE = hv,(T) — hv (Fig. 2c, d). 
For the 532-nm pumping, a maximum AE of ~173 meV is reached at 
around 180 K. With a further decrease in temperature, DAP emission 
begins and leads to a reduction in AE. Over the whole temperature 
range we investigate (77-340 K), AE remains positive, indicating that 
net cooling is possible. For the 514-nm pumping, a maximum AE of 
~53 meV is reached at around 200K, below which DAP emission 
begins, leading to a transition from a positive to a negative AE at 
around 180 K. Therefore, it is impossible to achieve net cooling below 
180 K when pumping at 514nm, in good agreement with our cooling 
experiments to be discussed (see below). Over the power range we 
investigate, no strong dependence of AE on power is observed, as 
manifested by the vertically striped bands in the surface plots (Fig. 2c, 
d). However, net laser cooling is possible only below 12 mW, because 
the two-photon process will dominate the luminescence up-conversion 
when a higher power is used (Supplementary Information, part 3.2). 
We use a pump-probe luminescence thermometry (PPLT) tech- 
nique to measure the laser cooling with the sample image, set-up 
and time sequence as shown in Fig. 3. The main idea is to use the 
temperature dependence of the Stokes photoluminescence peak posi- 
tion for non-contact local thermometry, essentially similar to diffe- 
rential luminescence thermometry’? (Supplementary Information, 
part 3.1). Nevertheless, in our laser cooling experiments the change 
in temperature is sufficient for our high-resolution spectrometer to 
detect cooling from the shifting of the luminescence peak to shorter 


Figure 3 | Laser cooling pump-probe luminescence thermometry. a, False- 
colour scanning electron microscope image of a single CdS nanobelt suspended 
on a SiO,/Si substrate. The laser cooling demonstration was conducted on the 
suspended segment as shown in the inset. b, Measurement set-up with two laser 
beams (pump and probe) aligned on the same spot. c, Time sequence of the 
PPLT. After every 5 min of pumping (for example, using a 532-nm laser), the 
pump laser was blocked momentarily and the probe Stokes photoluminescence 
spectrum was acquired immediately, excited by a 20-W, 473-nm laser. CCD, 
charge-coupled device. 
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wavelengths. We use a solid-state, 532-nm laser and an argon ion laser 
(variously 514, 502 and 488 nm) as pump lasers, and use a solid-state, 
473-nm laser as a probe beam to measure the cooling or heating. The 
CdS nanobelts are suspended to decrease the thermal conduction 
through substrates and to improve the luminescence extraction effi- 
ciency on hole-patterned SiO2/Si substrates (Methods Summary). 

We first measure and evaluate the temperature calibration curves by 
monitoring the peak shift of the photoluminescence spectrum of the 
CdS nanobelt excited by the probe beam around the starting tempera- 
tures, 290 and 100 K, respectively (Supplementary Fig. 5b, d). Then we 
use these calibration curves to deduce the temperature change caused 
by the laser cooling, AT. Figure 4a and Fig. 4d respectively show the 
Stokes photoluminescence evolution on continuous 6.3-mW, 514-nm 
laser pumping starting from 290 K and continuous 6.5-mW, 532-nm 
laser pumping starting from 100K. After every ~5 min of pumping, 
the pump laser was momentarily blocked while each spectrum was 
recorded. We observe a pronounced systematic blueshift on laser 
pumping until a steady state is reached in ~30-40 min, suggesting 
that a maximum net cooling is established for the given device struc- 
ture and geometry. After the cooling is stopped (by switching off the 
pump laser), the Stokes photoluminescence shifts to longer wave- 
lengths, indicating a temperature increase (dashed curves in Fig. 4a, 
d, shifted vertically for clarity). On the basis of the calibration curves, 
AT is plotted in Fig. 4b and Fig. 4e, showing respective net laser cool- 
ings of ~40 K for 514-nm pumping and ~20 K for 532-nm pumping, 
both at 290K. At 100 K, only 532-nm pumping can lead to a net laser 
cooling of 15K; 514-nm pumping cannot, owing to redshifting of 
the mean emission wavelength at 100K, in agreement with Fig. 2b, 
d. Pumping at 488 and 502 nm leads to heating of the sample at all 
temperatures in the range investigated. Complete data for all wave- 
lengths and temperatures are shown in Supplementary Fig. 7. 

Knowledge of the absorption at the band tail is crucial to evaluating 
the up-conversion and laser cooling. At the single-nanobelt level, the 
direct measurement of optical absorption remains elusive. We have 
devised a photoconductivity measurement to provide a direct quan- 
tification of the absorption”, which agrees well with the absorption 
coefficient extracted from the photoluminescence spectra based on 
the van Roosbroeck/Shockley equation (Supplementary Fig. 2b). 
Figure 4c shows the Stokes photoluminescence (black curve) and the 
corresponding photoconductivity gain spectra (blue curve and, on a 
logarithmic scale, red curve) at 290 K. We find that at 532 nm, there is 
about 6% absorption compared with the maximum absorption at 
around 480nm, whereas at 514nm, the corresponding absorption 
is ~36%. Figure 4f shows the normalized temperature change as a 
function of pump laser wavelength at 290K (blue curve), calculated 
on the basis of the SBE model and an analysis of cooling power 
(Supplementary Information, parts 4-6). The discrete data points pro- 
vide values of experimental temperature change, AT, pumped by four 
different laser lines, that are in good agreement with theoretical ana- 
lysis except for that corresponding to 488 nm, which underestimates 
the actual heating considerably (Supplementary Information, part 6). 
The final cooling power depends on both AE and band-tail absorption. 
In CdS nanobelts at 290 K, 514nm seems to be the optimal cooling 
wavelength with the highest cooling power. The red regions in Fig. 4c, f 
correspond to the laser cooling tail. 

Two underlying factors make net cooling impractical in GaAs-based 
semiconductors. These factors are high parasitic background absorp- 
tion and low luminescence extraction efficiency, both of which lead to 
heating and so represent the existing challenges not only in fun- 
damental materials science but also in device engineering in laser 
cooling’*"*"’. In CdS nanobelts, each cooling cycle can remove more 
heat than in GaAs, as indicated by the AE analysis, because of the 
extremely strong exciton-LOP coupling (the Frohlich coupling con- 
stant in bulk is 0.51 for CdS and 0.07 for GaAs”, and is even higher on 
the nanoscale, as shown in Supplementary Fig. 3b) and resonant anti- 
Stokes up-conversion. When the pump laser (at, for example, 514 or 
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Figure 4 | Net laser cooling of CdS nanobelts. a, Evolution of PPLT spectra 
starting from 290 K, pumped by a 514-nm laser with a power of ~6.3 mW. 
Solid curves represent the cooling cycle, and the dashed curves represent the 
warming up after the pump laser is switched off. Dashed curves are shifted 
vertically for clarity. b, Temperature change, AT, versus time pumped by four 
laser lines (532, 514, 502 and 488 nm), using data extracted from the PPLT 
spectra (shown in a for 514-nm pumping; complete data are shown in 
Supplementary Fig. 7) and corresponding calibration curve around 290 K 
(Supplementary Fig. 5b). c, Photocurrent gain spectrum (linear scale, blue; 
logarithmic scale, red) and Stokes photoluminescence spectrum (black) of a 
single CdS nanobelt at 290 K. d, Evolution of PPLT spectra for another nanobelt 


532 nm) moves to the Urbach tail of CdS, the background absorption 
due to free carriers can no longer be ignored. Nonetheless, our analysis 
shows that the free-carrier absorption in a CdS nanobelt is negli- 
gible. The luminescence extraction efficiency approaches unity as 
a result of the subwavelength thickness of the nanobelt (Supplemen- 
tary Information, part 4). We did not observe any emission at the 
longer-wavelength side of the pump laser wavelength (532 nm), sug- 
gesting no background absorption contribution due to defects or 
surface recombination. 

Compared with GaAs, CdS has a much smaller surface recombina- 
tion velocity” and a much lower Auger non-radiative recombination 
coefficient, owing to the larger bandgap”. Our previous data”? and 
more in-depth analysis have indeed revealed that the radiative recom- 
bination decay lifetime is of the order of tens to hundreds of picose- 
conds, whereas the non-radiative decay lifetime is three orders of 
magnitude larger than this at room temperature and is even greater 
at low temperatures, justifying the near-unity external quantum effi- 
ciency (>99%) in CdS nanobelts. Cooling efficiencies of ~4.8% at 
290 K and ~2.0% at 100K are estimated from the above analysis for 
532-nm pumping, and a cooling efficiency of 1.3% at 290K is esti- 
mated for 514-nm pumping (Supplementary Information, part 5). The 
cooling powers are estimated to be ~97 and 180 pW for 532- and 
514-nm pumping at 290K, respectively, with corresponding ther- 
mal conductive loads of about 80 and 160 1W, in good agreement 
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pumped by a 6.5-mW, 532-nm laser starting from 100 K (complete data are 
shown in Supplementary Fig. 7). We observe cooling and warming cycles 
similar to those in a. e, Temperature change, AT, versus time pumped by four 
laser lines (532, 514, 502 and 488 nm), using data extracted from the PPLT 
spectra (shown in d for 532-nm pumping; complete data are shown in 
Supplementary Fig. 7) and the corresponding calibration curve around 100K 
(Supplementary Fig. 5d). f, Summary of measured maximum AT (pink 
squares) and theoretically calculated temperature change (blue curve) 
normalized to pump power for different pump wavelengths at 290 K. In ¢ and 
f, the red regions correspond to the cooling zone. 


with the cooling power. The radiative load is negligible according to 
a black-body model for both wavelengths, suggesting that the thermal 
conductive load dominates the heat dissipation (Supplementary 
Information, part 6). 

Our work demonstrates a substantial net laser cooling of a semi- 
conductor CdS nanobelt by ~40K from room temperature when 
pumped by a 514-nm laser, and a net cooling by ~15 K from 100K 
when pumped by a 532-nm laser. In principle, the thermal conductive 
load can be further minimized by device engineering to facilitate the 
laser cooling of CdS nanobelts from ambient temperature to that of 
liquid nitrogen. Our work opens the way to the design of materials 
for the laser cooling of semiconductors with strong electron-LOP 
coupling. It would be an immediate advance to test other H-VI 
materials in the single-crystal, thin-film or nanomaterial morphology 
for laser cooling. 


METHODS SUMMARY 


The CdS nanobelts were synthesized in a home-built, vapour-transport chemical- 
vapour-deposition system. Synthesis and characterization details have been pub- 
lished elsewhere”. The CdS nanobelt was deposited on indium foil substrates or 
silicon substrates with a 100-nm oxide layer to evaluate the Raman spectrum 
(Fig. 1) and the mean emission wavelength (Fig. 2), respectively. The thickness 
of the nanobelt was measured by atomic force microscopy (Nanoscope III, Veeco 
Instruments). We conducted experiments using six different laser powers, at every 
~20-K interval between 77 and 340K, to generate the AE surface plot. The laser 


24 JANUARY 2013 | VOL 493 | NATURE | 507 
mited. All rights reserved 


LETTER 


cooling was performed on nanobelts suspended across inverted-pyramid-shaped 
holes (~3 pum wide) on a SiO,/Si substrate etched using a potassium hydroxide 
solution. The sample was mounted on the cold finger of a continuous-flow micro- 
scopy cryostat. A PPLT technique was used to cool and measure the local tem- 
perature variation of the nanobelts. The power of probe beam of wavelength 
473 nm was kept as low as 20 LW to eliminate the laser heating effect. Both the 
pump beam (Ar ion laser, 488, 502 and 514nm; Nd:YAG solid-state laser, 532 nm) 
and the probe beam (solid-state laser, 473nm) were collimated and focused 
thorough a X50 objective onto the single CdS nanobelt that crossed an etched 
hole. All the spectra were collected using a confocal triple-grating spectrometer 
(Horiba-JY T64000) in a backscattering configuration. With a 640-mm focal 
length and 1,800-mm_' grating, the spectrometer has a spectral resolution of 
~0.5cm™', corresponding to a resolution of ~0.01nm at a wavelength of 
~500nm. Full methods and more details about PPLT and the calibration can 
be found in the Supplementary Information. 
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The use of molecular spin state as a quantum of information for 
storage, sensing and computing has generated considerable inter- 
est in the context of next-generation data storage and communica- 
tion devices’’, opening avenues for developing multifunctional 
molecular spintronics*. Such ideas have been researched exten- 
sively, using single-molecule magnets** and molecules with a metal 
ion® or nitrogen vacancy’ as localized spin-carrying centres for 
storage and for realizing logic operations®. However, the electronic 
coupling between the spin centres of these molecules is rather 
weak, which makes construction of quantum memory registers a 
challenging task’. In this regard, delocalized carbon-based radical 
species with unpaired spin, such as phenalenyl’®, have shown 
promise. These phenalenyl moieties, which can be regarded as gra- 
phene fragments, are formed by the fusion of three benzene rings 
and belong to the class of open-shell systems. The spin structure of 
these molecules responds to external stimuli’? (such as light, and 
electric and magnetic fields), which provides novel schemes for 
performing spin memory and logic operations. Here we construct 
a molecular device using such molecules as templates to engineer 
interfacial spin transfer resulting from hybridization and magnetic 
exchange interaction with the surface of a ferromagnet; the device 
shows an unexpected interfacial magnetoresistance of more than 
20 per cent near room temperature. Moreover, we successfully 
demonstrate the formation of a nanoscale magnetic molecule with 
a well-defined magnetic hysteresis on ferromagnetic surfaces. 
Owing to strong magnetic coupling with the ferromagnet, such 
independent switching of an adsorbed magnetic molecule has been 
unsuccessful with single-molecule magnets’. Our findings suggest 
the use of chemically amenable phenalenyl-based molecules as a 
viable and scalable platform for building molecular-scale quantum 
spin memory and processors for technological development. 

The diversity and flexibility of molecular synthesis has given research- 
ers ample freedom to design functional molecules for spintronics. These 
include molecular magnets", spin-filter molecules'*, spin-crossover 
molecules!*, molecular batteries!’, molecular conductors’®, molecular 
switches’*, and spacer layers for organic spin valves'* and magnetic 
tunnel junctions’*”°. Using such synthetic techniques, we have designed 
a neutral planar phenalenyl-based molecule, zinc methyl phenalenyl 
(ZMP, C)4Hj902Zn; see Fig. la and Methods), that has no net spin. 
When these molecules are grown on a ferromagnetic surface, inter- 
face spin transfer causes a hybridized organometallic supramolecular 
magnetic layer to develop, which shows a large magnetic aniso- 
tropy and spin-filter properties”’. This interface layer creates a spin- 
dependent resistance and gives rise to an interface magnetoresistance 
(IMR) effect. 


We demonstrate IMR response in a vertical-junction molecular 
device consisting of a ZMP thin film grown on a (111) textured 
fic.c. cobalt (Co) film surface (see Methods). Figure 1b shows a 
magnetotransport measurement on such a device with copper (Cu) 
as the top contact electrode over a 40-nm ZMP spacer layer. The device 
is cooled down to 4.2 K ina magnetic field of —550 Oe. The nonlinear 
current-voltage (I-V) characteristics of the device (see Supplemen- 
tary Information) suggest tunnelling as the dominant transport mech- 
anism. A large magnetoresistance signal, close to 25%, is observed 
when the Co magnetization switches at its coercive field. Here we only 
show the minor loop corresponding to the switching of one ferromag- 
netic electrode, while the second electrode is non-magnetic Cu. 
Furthermore, when the bottom Co electrode is replaced by Cu, there 
is no magnetoresistance signal. These observations imply that the 
magnetoresistance effect arises due to the switching of the Co electrode 
with respect to a hard magnetic layer, either at the interface or within 
the bulk of the ZMP film. Bulk magnetism in ZMP can be ruled out 
(see Supplementary Information), clearly pointing to an interfacial 
phenomenon as being responsible for the IMR effect. 

The IMR response is confirmed by performing magnetoresistance 
measurements on a standard magnetic junction structure, replacing Cu 
with Permalloy (Py): Co/ZMP (35 nm)/Py (Fig. 2a). Instead of seeing a 
‘standard’ magnetoresistance signal, a large IMR of 42% with sharp 
switching at the coercivity (H_.) of Co (+38 Oe) is observed. A low-field 
scan showed a second independent IMR loop corresponding to the H. of 
the Py electrode (+3 Oe). The H, of the ferromagnetic electrodes is 
confirmed from the anisotropic magnetoresistance (AMR) measure- 
ments of the electrodes. These results suggest that the magnetoresis- 
tance does not arise from conventional spin-conserved tunnelling/ 
transport through the organic barrier between the two ferromagnetic 
electrodes”’. This is further confirmed by inserting an ultrathin layer 
of Al,O3, 0.7 nm thick, at the bottom interface to create an interface- 
modified junction—Co/Al,03/ZMP/Py—that showed a resistance 
change only when the Py magnetization was switched (Fig. 2b). 
Clearly, the ZMP has to be in direct contact with the ferromag- 
netic surface to observe IMR. The switching response of the inter- 
face layer was characterized in larger magnetic fields (Fig. 3a and b, 
and Supplementary Information). For Co/ZMP/Py at 4.2K, the 
bottom Co/ZMP interface layer switches at about +600 Oe to the 
low-resistance state (the switching field varied from device to device). 
After warming up to near room temperature (~250K), the device 
showed a large IMR of 22% with a lower switching field (~100 Oe) of 
the interface layer. 

A similar two-step switching magnetoresistance response has been 
reported in devices using one ferromagnetic electrode: (Ga,Mn)As/ 
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Figure 1 | Interface magnetoresistance effect. a, Molecular structure of zinc 
methyl phenalenyl (ZMP) in a neutral state with no net spin (top). Charge 
transfer processes through hybridization on the ferromagnet surface can 
change the chemical state of the phenalenyl moiety from neutral to an anionic 
radical (bottom) with net moment. b, Magnetoresistance measurements (data 
points) of a device with a single ferromagnetic electrode (Co (8 nm)/ZMP 

(40 nm)/Cu(12 nm)), measured at 15 mV after cooling the device to 4.2 Kina 


Al,O;/Al tunnel junctions’. The magnetoresistance in such tunnel 
devices is referred to as tunnelling anisotropy magnetoresistance 
(TAMR), which arises from the magnetization-dependent total den- 
sity of states of the tunnelling electrons at the surface. The observation 
of TAMR has also been reported in organic spin-valve structures using 
epitaxial lanthanum strontium manganese oxide electrodes”. How- 
ever, our above experimental observations suggest the origin of IMR in 
our device to be interfacial and different to that of TAMR. Though the 
details of the interface phenomenon need further study, we suggest an 
interface model strengthened by ab initio calculations that reasonably 
explain our experimental observations. Figure 3c and d shows the 
model: the interface magnetic layer comprises the hybridized Co sur- 
face layer with an adsorbed ZMP dimer. The adsorbed dimer forms a 
pair of magnetic and spin-filter molecules. The Co electrode with spin 
polarized s-p electrons serves as the spin-polarizer, and the interface 
magnetic layer with spin-dependent interface resistance serves as the 
spin analyser, giving rise to IMR. 

Chemical modification of the ferromagnet/ZMP molecule interface 
arises owing to charge transfer and hybridization of the molecular 
orbitals with the d-orbitals of the ferromagnet, creating new hybrid 
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Figure 2 | Confirmation of IMR. a, Top panel, magnetoresistance 
measurements (black data points) on a Co (8 nm)/ZMP (35 nm)/Py (12 nm) 
device (at 25 mV) show independent IMR response of both the bottom Co/ 
ZMP interface and the top ZMP/Py interface around zero field. Left inset, 
zoomed-in view showing low field scan. Right inset, IMR of bottom Co/ZMP 
interface versus applied bias shows a flat response under low bias conditions 
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Magnetic field (Oe) 


magnetic field (about —550 Oe). Blue and red data points refer to positive and 
negative field sweeps respectively. The solid lines are guides to the eye. The 
magnetoresistance loop corresponds to the switching of the Co magnetization, 
as shown in the two insets, with respect to the hard magnetic layer forming at 
the interface (yellow). Left and right insets show the Co magnetization pointing 
in the negative and positive direction, respectively. 


metal-organic interface states that directly affect the electronic and 
magnetic properties of the adsorbed molecule™*. These interactions 
strongly depend on the specific molecular structure and morpho- 
logy**”’, influencing the efficiency of spin polarization and spin injec- 
tion at the interface. In the case of ZMP, the planar geometry of the 
molecule provides an ideal situation for strong orbital overlap leading 
to d-p hybridization effects. Spin-polarized first-principles calcula- 
tions using density-functional theory (DFT) were carried out in zero 
magnetic field on such a hybrid metal-organic interface, comprising 
five Co(111) layers and either one or two molecular layers of ZMP on 
top (see Methods and Supplementary Information). 

The p, atomic-type orbitals of the planar ZMP molecule hybridize 
strongly with the d-states (mainly with dz, d,z and d,, orbitals) of the 
Co atoms, forming hybrid molecule-metal p,-d interface states with 
bonding and anti-bonding characters™*. The calculated spin-resolved 
projected density-of-states (PDOS) is shown in Fig. 4a and b. The 
molecular energy levels of the first adsorbed ZMP molecule are broa- 
dened, becoming a component of the metallic hybrid molecule- 
surface states. In addition, the molecular layer acquires a net moment 
of 0.11jg oriented anti-parallel to the moment of the hybridized 


-50 0 
Magnetic field (Oe) 


with a maximum of 50% in negative bias. Middle and bottom panels show AMR 
measurements of Co and Py electrode, respectively. b, Magnetoresistance 
measurements on an interface-modified junction, Co (8 nm)/Al,O3 (0.7 nm)/ 
ZMP (30 nm)/Py (12 nm), show 3% IMR (at 50 mV) due only to Py switching 
(data points). The blue and red curves refer to positive and negative field 
sweeps, respectively, and arrows show sweep direction. 
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Figure 3 | Interface switching response and interface model. a, b, IMR 
measurements under high magnetic field on the device Co (8 nm)/ZMP 

(35 nm)/Py (12 nm): a, at 4.2 K showing large switching field (H. ~ 600 Oe) of 
the interface layer; b, at 250 K, showing large IMR of 22% with reduction in 
switching field (~100 Oe). c, Ab initio model: top view (left) and side view 
(right) of the relaxed molecules’ configuration on a Co(111) surface; grey, 
carbon; red, oxygen; purple, zinc. The first-layer ZMP molecule (‘magnetic 
molecule’) absorbs flat, with six «-carbons and the central carbon of the 
phenalenyl sitting directly on top of Co atoms, while the B-carbons of the 
phenaleny] sit in a bridge position between Co atoms. The molecular plane is 
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Figure 4 | Spin-filter effect and interface MAE. a, Spin-resolved PDOS of the 
p-states for the first-layer (green) and second-layer (purple) ZMP molecule of 
the relaxed interface system. Interface p,-d hybridization creates spin- 

unbalanced electronic structure in the two molecules. b, Zoomed-in view of the 
PDOS ina in the energy interval [—1.5, 1.5 eV]. The LUMO of the second ZMP 
molecule is spin-split by 0.14 eV, leading to a difference in the barrier height for 


~2.1-2.2 A above the Co(111) surface (the C atoms are ~2.1 A above, the O 
and Zn atoms are ~2.2 A above). The second molecule (‘spin filter molecule’) 
sits in a staggered configuration over the first adsorbed molecule with a spacing 
of ~3.26 A, forming a molecular z-dimer. d, Device model: an interface 
magnetic layer comprises the surface hybridized Co atoms with the molecular 
dimer on top, forming a magnetic molecule and a tunnel spin-filter molecule. 
Shown are relative magnetization directions of the interface magnetic layer and 
the bottom bulk Co at various field values (R, resistance; H, magnetic field; 
arrow labelled Hgc shows direction of field cooling). Arrows in the spin-filter 
molecule represent the two electron-spin energy levels. 
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the two spin channels. c, Calculation of MAE due to surface magnetocrystalline 
anisotropy for the organometallic supramolecule comprising seven hybridized 
Co atoms (gold) with the ZMP molecular dimer as a model system. The MAE of 
the supramolecule is calculated (table) for three different in-plane 
magnetization orientations with respect to the out-of plane magnetization 
defined here as the c axis. 
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surface layer of Co atoms giving an anti-ferromagnetic (AFM) coup- 
ling Uco-zmp < 0). The moment of the surface Co atoms is lowered to 
~1.7g per atom (bulk moment ~1.9 yg per atom). 

Our calculations show that the distance between the phenalenyl rings 
in the magnetic molecule and the molecule above is ~3.26 A, which is 
smaller than the sum of the van der Waals radii of the carbon atoms". 
This suggests a weak chemical bonding of the molecular n-dimer. The 
PDOS of the second-layer ZMP molecule resembles the molecular-type 
discrete energy levels above and below the Fermi level (Eg). Although 
the calculations show that the second ZMP molecule has no magnetic 
moment, the m-m interaction with the magnetic molecule creates a 
spin-unbalanced electronic structure. The first energy level above Ey 
(the lowest unoccupied molecular orbital (LUMO) level) is observed to 
be spin-split by ~0.14eV (Fig. 4b), defining two different barrier 
heights for spin tunnelling injection: a lower one for spin-down elec- 
trons (#)) of 0.73 eV and a higher one for spin-up electrons (#;) of 
0.87 eV, leading to a spin-dependent interface resistance. It is well 
known that conventional spin-filter tunnelling of this type (using 
magnetic semiconductors such as EuS, EuO, and so on) can be highly 
efficient”. 

The independent switching of the interface magnetic layer close to 
room temperature implies a significantly large surface magnetic aniso- 
tropy energy (MAE ~>10kT). Because the molecule is composed of 
low-atomic-number elements that show weak spin-orbit coupling, the 
magnetic molecule alone cannot account for such large MAE. DFT 
study shows that the induced magnetic moment in the adsorbed ZMP 
molecule depends directly on the magnetic interaction with the surface 
Co layer. Therefore, the observation of IMR response suggests inter- 
face hybridization as the source of MAE. We expect an enhancement 
in MAE*” of the Co surface (K,u,, in units of per surface atom) and 
significant weakening of the magnetic exchange interaction between 
the surface Co layer and the layers below it (J, ). Calculations of MAE 
were performed on a model organometallic supramolecule, compris- 
ing seven Co atoms with the molecular dimer on top. The calculated 
MAE for different in-plane magnetization orientations is listed in 
Fig. 4c, showing a large enhancement of in-plane Ky, ~180 eV 
per surface Co atom as compared to bulk anisotropy energy of 
19 eV per atom”. This enhancement in K,y, is in line with our experi- 
mental observations. Further, the anisotropy tends to be uniaxial, with 
the easy axis along the field cooled orientation (see Supplementary 
Information). 

Magnetic exchange interactions at the ferromagnet surface can be 
weaker than within the bulk. Owing to reduction in the moment and 
fewer magnetic neighbours at the interface, the interatomic magnetic 
exchange coupling (J) can reduce considerably, and become compar- 
able to or even lower than the surface anisotropy”: that is, 
J! Kgur~<1. In such cases, the bulk magnetization can switch before 
the surface magnetization**. For bulk f.c.c. Co, J is computed to be 
~8.54 meV per atom, consistent with the values reported by others” 
(see Supplementary Information). For a hybridized Co surface this 
value is reduced to 1.34meV per Co atom, corresponding to a 70% 
reduction in J, compared to a clean Co surface (see Supplementary 
Information for J, calculation). The strong effect of interface hybrid- 
ization on the properties of the Co surface layer is clearly evident from 
these studies, signifying a large increase in the ratio K,,,/J , compared 
to the bulk (K/J ~ 0.002)°*. This phenomenon explains the switching 
response of the interface magnetic layer responsible for the observed 
IMR. Deeper understanding of the interface mechanisms responsible 
for magnetic switching and the occurrence of IMR is a subject for 
future studies. 

In order to use the adsorbed magnetic molecule as a source of 
information or a magnetic ‘bit’, one needs to be able to independently 
switch the moment of the adsorbed molecule. The use of phenalenyl- 
derived molecules provides a unique advantage in this regard. Using 
chemical synthesis, one could tune the chemical state (that is, cation, 
radical or anion) as well as the topological state of the phenalenyl 
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moiety in the molecule, and control the charge transfer and hybridi- 
zation effects with the ferromagnet surface. In the case of ZMP, the 
observation of an asymmetric magnetoresistance behaviour near room 
temperature (Fig. 3b) suggests a weakening of magnetic exchange 
coupling Jco-zmp; leading to independent magnetization switching of 
the magnetic molecular dimer from parallel to anti-parallel alignment 
and vice-versa with respect to the hybridized surface Co layer. The 
above results demonstrate the potential for engineering the interface 
magnetic exchange interaction at the molecular level to control the 
independent hysteretic response of the adsorbed magnetic molecule. 
In addition, the IMR response provides a robust mechanism to probe 
and manipulate the spin state of the magnetic molecule, paving way to 
the development of room temperature molecular memory devices. 
Furthermore, the spin structure of the phenalenyl ligands can take 
the role of bus qubits’ in tailor-made molecular networks for realizing 
future molecular quantum information processing systems. 


METHODS SUMMARY 

Synthesis of ZMP. (ZMP has CCDC structure no. 859902.) As a starting material 
9-hydroxy-1-oxophenalene was synthesized and dissolved (0.196 g, 1.0 mmol) in 
tetrahydrofuran (20 ml) and ZnMe (1.2M in toluene; 1.0 ml, 1.2 mmol) was 
added. The resulting solution was concentrated under reduced pressure and kept 
at —20 °C for a day to develop suitable crystals (see Supplementary Information 
for details). 

Device preparation. The thin film devices were fabricated on pre-cleaned glass 
substrates in a high vacuum thermal deposition chamber (10° ’ torr) with in situ 
shadow masking. The Co bottom electrode was grown on cooled substrates (80 K) 
using electron-beam evaporation. The ZMP organic layer and top electrodes 
were subsequently grown at room temperature. The junction area was 200 um 
X 200 jum. The device was covered by a 10-nm-thick Al,O; capping layer. In each 
pump-down 72 junctions were made, and about 15 such junction sets were inves- 
tigated by varying the organic layer thickness and the interface as well as the type of 
top and bottom electrodes. 

DFT method. Spin-polarized first-principles calculations were performed using 
density-functional theory (DFT) by employing the generalized gradient approxi- 
mation (PBE, for Co atom PBE+U, with U.=3.0eV; see Supplementary 
Information for details) in a projector augmented plane-wave formulation as 
implemented in VASP code. The structural relaxations included the van der 
Waals forces in a semi-empirical approach. 
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Sustainable bioenergy production from marginal 


lands in the US Midwest 


Ilya Gelfand’, Ritvik Sahajpal’**, Xuesong Zhang’”, R. César Izaurralde'***, Katherine L. Gross! & G. Philip Robertson’”° 


Legislation on biofuels production in the USA' and Europe”* 
is directing food crops towards the production of grain-based 
ethanol**, which can have detrimental consequences for soil car- 
bon sequestration’, nitrous oxide emissions’, nitrate pollution’, 
biodiversity’ and human health*®. An alternative is to grow 
lignocellulosic (cellulosic) crops on ‘marginal’ lands’. Cellulosic 
feedstocks can have positive environmental outcomes’**' and 
could make up a substantial proportion of future energy port- 
folios'*’*. However, the availability of marginal lands for cellulosic 
feedstock production, and the resulting greenhouse gas (GHG) 
emissions, remains uncertain. Here we evaluate the potential for 
marginal lands in ten Midwestern US states to produce sizeable 
amounts of biomass and concurrently mitigate GHG emissions. In 
a comparative assessment of six alternative cropping systems over 
20 years, we found that successional herbaceous vegetation, once 
well established, has a direct GHG emissions mitigation capacity 
that rivals that of purpose-grown crops (—851 + 46 grams of CO, 
equivalent emissions per square metre per year (gCO2e m ” yr“ ‘)). 
If fertilized, these communities have the capacity to produce about 
63 + 5 gigajoules of ethanol energy per hectare per year. By con- 
trast, an adjacent, no-till corn-soybean-wheat rotation produces 
on average 41 + 1 gigajoules of biofuel energy per hectare per year 
and has a net direct mitigation capacity of —397 + 32 gCO,em 7 yr '; 
a continuous corn rotation would probably produce about 62 +7 
gigajoules of biofuel energy per hectare per year, with 13% less 
mitigation. We also perform quantitative modelling of successional 
vegetation on marginal lands in the region at a resolution of 
0.4 hectares, constrained by the requirement that each modelled loca- 
tion be within 80 kilometres of a potential biorefinery. Our results 
suggest that such vegetation could produce about 21 gigalitres of 
ethanol per year from around 11 million hectares, or approximately 
25 per cent of the 2022 target for cellulosic biofuel mandated by the 
US Energy Independence and Security Act of 2007, with no initial 
carbon debt nor the indirect land-use costs associated with food- 
based biofuels. Other regional-scale aspects of biofuel sustainability’, 
such as water quality'’"* and biodiversity’’, await future study. 

US legislation mandates the annual production of 80 Gl of ethanol 
from non-grain sources by 2022’, which will represent ~25% of pro- 
jected best-case liquid transportation fuel needs by 2050. Although 
much of the land need could be satisfied by growing high-productivity 
feedstocks on fertile land now used for grain production", this strategy 
fails to consider the long-term need for fertile land to meet future food 
demands and the need to abate the indirect GHG impacts created 
when land now in grain ethanol production was originally diverted 
to biofuel use*’”'’. 

An alternative is to grow cellulosic crops on marginal lands’. 
Marginal lands are those poorly suited to field crops because of low 
crop productivity due to inherent edaphic or climatic limitations or 
because they are located in areas that are vulnerable to erosion or other 


environmental risks when cultivated. Often such lands are suitable for 
grasses, short-rotation tree crops or other perennial vegetation with 
persistent roots that are better adapted to low-nutrient, erodible or 
droughty soils. As long as the conversion of these lands to biofuel 
production avoids local carbon debt*’? and the replacement crop 
removes from the atmosphere more CO, than the pre-existing vegeta- 
tion would have stored'””, cellulosic crops on marginal lands might 
provide substantial GHG emissions mitigation without the risk of 
indirect carbon costs due to displaced food and feed production”. 
Recent estimates place the indirect carbon costs for corn grain ethanol 
at 25-200 gCOze MJ © (ref. 18), putting at risk much of the fossil fuel 
offset benefit of biofuels grown on arable land. 

Still uncertain, however, are the availability of non-arable marginal 
lands for cellulosic feedstock production and the resulting implications 
for GHG emissions. First, not all marginal lands are sufficiently close to 
a potential biorefinery to make transportation economical. Second, the 
productivity of marginal lands can vary because of low-fertility soils 
and location-specific interactions between soils and climate. Third, not 
all cellulosic feedstocks deliver equivalent climate benefits: soil carbon 
sequestration, N»O fluxes and other sources of global warming can 
differ widely among different feedstocks”. Of available cellulosic 
feedstocks, only existing herbaceous vegetation—or purpose-grown 
feedstocks such as switchgrass seeded into existing vegetation—avoids 
local carbon debt associated with land conversion and planting**’””, 
but the potential productivity of this vegetation has been questioned” 
and there have been no long-term comparative analyses of its potential 
GHG benefits. 

Here we provide a 20-yr comparative assessment of direct GHG 
benefits, or GHG balances, for six alternative biofuel cropping systems, 
including native successional vegetation, replicated at the Kellogg 
Biological Station (KBS) Long-term Ecological Research (LTER) site 
in southwest Michigan, USA. We compare GHG emissions and the 
productivity of established successional vegetation with those of both 
annual and perennial crops at a single, moderate-fertility location, 
without consideration of indirect carbon costs. We then project the 
potential for early successional vegetation to produce biofuel on 
marginal lands across a ten-state region of the US Midwest using a 
well-established quantitative crop productivity model calibrated at 
KBS for successional vegetation and tested elsewhere (Methods 
Summary). Our intent is to provide a conservative and realistic esti- 
mate of the capacity of marginal lands in the US Midwest to produce 
cellulosic biofuel without carbon debt’® and without the added carbon 
cost of indirect land-use change”. 

Twenty-year patterns of GHG balances for the annual cropping 
systems (Table 1) show that without fossil fuel offsets, the convention- 
ally managed system hada net release of GHG equivalents, whereas the 
no-till system accumulated enough soil carbon to offset the GHG costs 
of farming inputs and N,O losses. In contrast to earlier results”, the 
poplar system was also a net source of GHG, owing mainly to soil 


1Great Lakes Bioenergy Research Center, Michigan State University, East Lansing, Michigan 48824, USA. *W.K. Kellogg Biological Station, Michigan State University, Hickory Corners, Michigan 49060, USA. 
3 Joint Global Change Research Institute, Pacific Northwest National Laboratory and University of Maryland, College Park, Maryland 20740, USA. “Department of Geographical Sciences, University of 
Maryland, College Park, Maryland 20740, USA. °Department of Plant Biology, Michigan State University, East Lansing, Michigan 48824, USA. ®Department of Plant, Soil, and Microbial Sciences, Michigan 
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Table 1 | Average annual GHG emissions for each candidate feedstock cropping system 


Cropping system Soil carbon* NoO+ CHat Farming inputst Net GHG balance 
N P K Lime Fuel Seeds Pest control 
Conventional CSW Oz31 34+6 -—018 20.1 327 0.4 1 3 13 7 7 98231 
No-till CSW =-122+31 3544 =0.6 20.1 32.7 0.3 1 4 9 7 16 =20 = 31 
Alfalfa =—122 +92 3343 =I = 0.1 @) 0.3 4 14 11 6 3 —52 +92 
Poplar 612153 173 =—O18 + 0.0 3.3 6) ) 0 il ) 2 84+ 153 
Successional —397 +31 il a =-1.1240.1 ie) 0) 0) 0 0 ) 0) —38S 231. 
Successional + N —397 +31 27=5 =-12205 55.5 6) ) 0 8 ) 0) —308 231 


Values shown are means + s.e. (n= 4-6; gCO2em “yr +), calculated without consideration of fossil fuel offset credits (Table 2) or the cost of indirect land use change for the period 1989-2009. For detailed 


information, see Supplementary Tables 1 and 6-10. CSW, corn-soybean-wheat rotation. 


*A/A, horizon (surface soil to plow depth), from ref. 21; there were no significant soil carbon changes in lower horizons to 1 m. For calculation see Supplementary Information, equation (2). 


pn=4. 
¢ Rotational average. 


carbon loss following harvest*’. This is probably due to microbial 
activity stimulated by warmer and wetter soil conditions caused by 
the absence of shade and by lower transpiration by resprouting 
trees before canopy closure’*. Conversely, alfalfa provided significant 
mitigation despite high N2O fluxes. In the successional system, high 
rates of soil carbon accumulation and low N,O emissions provided 
substantial mitigation under both unfertilized and fertilized conditions 
(Table 1). 

Including fossil fuel offset credits makes all six systems net direct 
sinks of atmospheric CO). Biorefinery yields of 0.43 1 of bioethanol per 
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Figure 1 | GHG balances of alternative cropping systems in southwest 
Michigan for biofuel feedstock production. a, Detailed GHG balance; b, net 
GHG balance. Bars represent different components of the GHG balance for a 
given system and were measured as soil emissions of NxO and CH, (1989- 
2010), changes in soil carbon (1989-2001), and agronomic inputs and yields 
(used to calculate fossil fuel credits) from 1989 to 2009. Agricultural inputs 
include GHG emissions from farm machinery, agricultural chemicals and crop 
seed production. Fluxes of CH, are insignificant and are too small to be visible 
in the figure. Errors, s.e. (n = 4-6 replicates). CT, conventionally tilled system; 
NT, no-till system. 


kilogram of dry corn grain and 0.38 | of cellulosic ethanol per kilogram 
of biomass'”*! provide offsets of 280 + 16 and 309 + 10 gCO,em ” yr! 
for conventional and no-till annual crops; 319+ 35 and 514+ 14 
gCO,em “yr ‘for the perennial poplar and alfalfa; and 463 + 34 
and 624+ 46gCO,em “yr ' for unfertilized and fertilized succes- 
sional systems (Fig. la and Table 2). Including these offsets in the 
GHG balance (Fig. 1b) results in net direct mitigation that ranges from 
—247 + 35 to —397 + 32 gCO,em “yr * for the annual crops, from 
—241 + 157 to —605+93gCO,em “yr ' for the perennial crops 
and from —851 +46 to —932+55gCO.em “yr ' for the succes- 
sional communities (Table 2). 

In absolute terms, energy production was highest for the fertilized 
successional system (63+5GJha yr ') and lowest for poplar 
(30+ 3G)ha_~ yr '); other systems had intermediate values of energy 
production (Table 2). Had the annual no-till crop rotation been con- 
tinuous corn with county-average yields rather than being corn- 
soybean—-wheat at KBS, we estimate that the energy yield would have 
been ~60% higher (62 + 7 GJ ha’ yr) with a direct GHG balance 
~13% less negative (Table 2). Although the GHG emission intensity of 
the fertilized successional system was higher than that of an unfer- 
tilized system (—147 + 14 versus —197 + 18 gCOze MJ for a 90% 
harvest efficiency), it remains substantially lower than that of most other 
systems examined, with a direct GHG emission reduction of 105% 
relative to fossil fuels (Table 2). In contrast, the wheat and corn portions 
of the annual rotations had GHG emission reductions of only 23 and 
28%, respectively, although the equivalent soybean reduction was 
240% owing to the allocation of emissions to co-products (Table 2). 

The low mitigation potential of the poplar system is due to its rela- 
tively low productivity in addition to the soil carbon loss noted above. 
Additional fertilization®® and protection from a 2-yr defoliation by 
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Figure 2 | Above-ground biomass production of the successional system 
between 1989 and 2009 in 1-ha treatment plots. Inset, effects of nitrogen 
fertilizer (+N; 124kgN ha! yr ') and no fertilizer (0 N) in subplots. Errors, 
s.e. (1 = 6). 
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Table 2 | Energy production, GHG emission intensity, GHG emission reduction, fossil fuel offset credit and net GHG balance for the cropping 


systems in Table 1 


Cropping system GHG emission intensity 


(gCO2e MJ~+) (GJ hat yr} 


Energy production 


Net GHG balance 
(gCOzem “yr 4) 


Fossil fuel offset credit 
(gCOzemyr~?) 


GHG emissions 
reduction (%) 


Conventional CSW* -65+11 3844 28 (C), 240(S), 23 (W) 280+ 16 247 +35 
No-till CSW ~97+9 41+2 31(C), 240(S), 25 (W) 309 +10 —397 +32 
No-till CC# —56+12 62+7 40 (C), 40(C) 256 +52 —344 +60 
Alfalfa ~119+19 5141 111 514414 —605 +93 
Poplar —80+52 30+3 112 319+35 241 +157 
Successionalt —~260 + 24 to -197 +18 24+2t043+3 114 463 + 34 —851+46 
Successional + Ni 208 +19 to -147+14 35+3to63+5 105 624 + 46 —932 +55 


Values shown are means = s.e. (n = 6), except for GHG emissions reductions, which are given as per cent relative to gasoline (94 gCO2 MJ~+) for each grain crop in rotation (C, corn; S, soybean; W, wheat), the 
numbers are for grain based biofuels. For wheat straw, the GHG emissions reduction is 107%. Energy production was calculated from biomass yields (Supplementary Table 1). No corn or soybean stover was 
harvested for bioenergy production, owing to site-specific limitations to protect soil carbon, whereas wheat straw was harvested. 

* Use of average corn yields for Kalamazoo county instead of KBS (Supplementary Methods) would increase fossil fuel offset credit for the rotation by 8% (303 + 10 gCOzem *yr~') and the overall net GHG 


balance by 9% (—270 + 33 gCOzem “yr +). 


+ Estimated values for a no-till corn—-corn (CC) rotation with all grain and 17% of stover removed for bioethanol production, including a 10% continuous-corn-rotation yield penalty (Supplementary Table 1). Fossil 
fuel offset credits are from ref. 19; for calculation of the GHG emissions reduction, we used relative contributions of corn grain and corn stover ethanol to total GHG reduction. For net GHG balance, we assumed that 
soil emissions of N20 and CHa and soil carbon sequestration rates were similar to measured rates in the no-till CSW rotation. 

+The range for the successional system reflects a 55% contemporary and 90% potential harvest efficiency. 


gypsy moths”° might have improved poplar yield in its first rotation. On 
average, nitrogen fertilization boosted yields by 47% in the suc- 
cessional system (Fig. 2, inset, and Supplementary Table 1) and pro- 
vided a fossil fuel offset credit greater by 35% (Table 2). Although the 
carbon cost of additional fertilizer and the N2O flux it induced (Table 1) 
negate part of the net GHG gain in the fertilized successional system 
(Fig. 1), the higher productivity with fertilization allows for greater 
land-use intensity from these systems with little additional carbon cost. 

We used the Environmental Policy Integrated Climate (EPIC) 
model’’*’ to estimate the productivity of fertilized and unfertilized 
successional systems (Table 1 and Fig. 1) across a ten-state region of 
the US Midwest. The model was run on a fine-grained spatial scale 
(0.4-ha resolution) to capture interactions between local soils and 
climates and to allow modelled cells within an explicit geographic 
area to be aggregated by distance. Simulated annual yields for unfer- 
tilized marginal lands in the region averaged 6+3Mgha ‘yr ‘ 
(Supplementary Table 2). Simulated fertilization at 68 kgN ha! yr: 
(kgN, kilograms of nitrogen) increased average yields by 36% to 
8+3Mgha ‘yr ‘(Supplementary Table 2), which is consistent with 
measured fertilization response over grasslands in North America” 
and with measured estimates of on-farm switchgrass production within 
the region*'. Our modelled estimates of switchgrass productivity 
correlate well with those of successional vegetation (Pearson correla- 
tion coefficient, R* = 0.94; Supplementary Fig. 1). 

For the region, total modelled yield sums to 262Tgyr | (Sup- 
plementary Table 3). Of this, about 58%, or 152 Tg yr 1 are produced 
within 80km of 35 potential biorefineries, each with a minimum 
feedstock requirement of 653 X 10° Mgyr ‘ (Supplementary Infor- 
mation, equation (1)). This provides a total fossil fuel offset of up to 


(Siew = Ss E c 
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44TgCO.eyr | (Fig. 3, Supplementary Fig. 2 and Supplementary 
Table 3). Total ethanol production for all biorefineries sums to 
20.8 Glyr_' and represents ~27% of the cellulosic ethanol mandated 
by the US Energy Independence and Security Act of 2007’. The 80-km 
collection radius is optimal for the current distribution of marginal 
lands in the region. With no biorefinery minimum feedstock require- 
ment, maximum production would decline by ~5% were the collec- 
tion radius increased to 120km (19.2 Gl yr’) or decreased to 40km 
(19.3 Glyr~ +) (Supplementary Fig. 3). A fine-scale distributed network 
of smaller processing facilities** would relax the constraint imposed by 
the 80-km collection radius and increase collectable biomass from the 
42% now excluded. 

Not all lands classified as marginal for crop production are idle; on 
average about 8% of our biomass production areas are grazed (with a 
range of 0-71%), which reduces our estimate of available biomass 
from marginal lands by as much as 10% (Supplementary Table 3). 
Depending on collection radii and biorefinery size restrictions, exclud- 
ing grazed areas from our analysis lowers our estimated biomass 
production to 136Tgyr ', for a fossil fuel offset of 39 TgCO.e yr | 
and an ethanol yield of 18.6 Glyr_* (Supplementary Table 4). 

In contrast to earlier estimates of potential cellulosic feedstock avail- 
ability in ref. 12, our analysis limits production to marginal lands and 
uses a process-based crop model operating on a fine-grained scale to 
estimate biomass yields, rather than estimating yields by statistical 
extrapolation to both marginal and cropped land. A state-by-state 
yield comparison reveals little relationship between the average yields 
predicted by the two approaches (R” = 0.008; Supplementary Table 5). 
Overall, our approach predicts average yields that are 20% lower than 
those projected by ref. 12. 


Figure 3 | Potential biomass collection areas for 
cellulosic biorefineries within ten US midwest 
states. Each circle represents an area of 80-km 


<50 radius with sufficient biomass resources to produce 
| 51-150 at least 89 MI of cellulosic ethanol per year, 
{9 151-250 according to quantitative simulation of yields from 
Mi 251-500 fertilized successional vegetation on non-forested 
Ml 501-1,000 marginal lands at a resolution of 60 m X 60m. 
Mi 1,001-5,000 County-scale totals of marginal lands biomass 
I >5,000 


production are shown in green. 
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Restoring or improving the productivity of marginal lands while 
using them as biofuel cropping systems might allow them to contribute 
even more to US energy and GHG mitigation goals in addition to 
providing greater conservation benefits. Many marginal lands are 
degraded and not managed, others simply have low productivity, 
and still others are environmentally sensitive and perhaps in set-aside 
programs such as the US Department of Agriculture’s Conservation 
Reserve Program. Our results show that management allowing the 
continued growth of perennial herbaceous species on these lands, 
undisturbed but for harvest and fertilization, could avoid the carbon 
costs of conversion, forgone sequestration and indirect land-use 
change. Furthermore, management could promote the establishment 
and growth of highly productive species such as future switchgrass 
cultivars with high nitrogen-use efficiency. This would provide addi- 
tional GHG benefits in the form of increased energy yields, nitrogen 
fertilizer savings and greater soil carbon sequestration’, and might 
concomitantly improve site fertility’. Other aspects of biofuel sustain- 
ability’ not considered here, such as landscape-scale water quality'* 
and biodiversity benefits’*, might be readily co-managed. 


METHODS SUMMARY 

GHG emissions quantification. We studied six ecosystems established in south- 
west Michigan”? in 1989 at the KBS LTER site (http://lter.kbs.msu.edu). We com- 
pared two 3-yr rotations of corn (Zea mays L.), soybean (Glycine max (L.) Merrill) 
and wheat (Triticum aestivum L.) with all grain biomass and wheat straw used for 
biofuel production (Supplementary Table 1). One system was chisel-plowed (con- 
ventional tillage) and the other was permanent no-till. We also analysed three 
perennial systems: alfalfa (Medicago sativa L.), short-rotation poplar trees 
(Populus X canadensis Moench var. eugenei) and a successional old-field system 
that has been burned biannually since 1997 and annually since 2002. In the suc- 
cessional system, we compared biomass production with and without nitrogen 
fertilization. For each system, we constructed total net GHG (CO,e) budgets that 
include the carbon costs of all farming operations and inputs from 1989 to 2009 
(Supplementary Tables 6-8) and measured fluxes of N,O and CH, for the period 
1991-2010 (Supplementary Table 9), the annual accumulation of soil carbon to 1 m 
for the period 1989-2001 (Supplementary Table 10) and fossil fuel offset credits 
calculated from the biomass produced weighted by ethanol and biodiesel life cycle 
impacts (Supplementary Table 11). None of our GHG budgets include estimates of 
indirect GHG impact associated with diverting food crops to biofuel production’””*. 
The total net GHG budget for an equivalent no-till, continuous corn rotation was 
calculated using average corn yields of Kalamazoo County, Michigan and fossil fuel 
offset credits from ref. 19, together with the net GHG balance of the no-till system. 
Regional productivity modelling. The EPIC model was evaluated for successional 
vegetation growing on former agricultural fields with above-ground net primary 
production data from the KBS and Cedar Creek, Minnesota LTER sites (Sup- 
plementary Materials). Yields for biofuel feedstock production were then simulated 
across a ten-state region at a resolution of 60 m X 60 m (Supplementary Table 3). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


GHG sampling and calculation of CO, equivalents. The CO, equivalents 
(gCOze m? yr ') for N,O and CH, emissions were calculated using IPCC 
100-yr horizon factors*? (Supplementary Information, equations (3) and (4)). 
Measurements of soil GHG fluxes (Supplementary Table 9) were performed 
bi-weekly using a static-chamber method”. Cumulative emissions of N,O (grams 
per hectare of nitrogen from N,O) and CH, (grams per hectare of carbon from 
CHy) for each plot were determined by interpolating daily fluxes between sam- 
pling days over the course of the entire growing season from first thaw in late 
winter until soils froze in late autumn during 1989-2010. 

Farming CO,¢ calculation. Total GHG emissions associated with farming 
(Supplementary Tables 6-8) were calculated as the sum of COze emissions from 
the production of fertilizers and herbicides and from farm agricultural machinery 
fuel use. Calculations (Supplementary Information, equation (5)) were based on 
actual field practice at the study sites, with average fuel use and production costs 
from published tables”***’ and the LTER Narrative Agronomic Field Log (http:// 
Iter.kbs.msu.edu/datatables/16). For perennial and successional systems, GHG 
emissions were calculated on a yearly basis. 

Calculation of fossil fuel displacement due to the use of renewable fuels. We 
used results of published analyses for the calculation of fossil fuel displacement due 
to the use of renewable fuels and their co-products, or fossil fuel offset credits, 
similar to the procedure explained in ref. 19. 

For carbon offsets due to biodiesel production, we used published results for the 
GREET life cycle analysis model’*, which compares five different approaches for 
crediting GHG emissions allocations to co-products: a displacement approach, an 
allocation approach based on the energy value of co-products, an allocation 
approach based on the market value of co-products and two hybrid approaches 
that integrate the displacement and allocation methods. We used the approach 
based on the energy value of co-products, which sums to 7.6 kgCO,¢ per kilogram 
of soybean diesel, or 193.9 gCO,e MJ‘ using a biodiesel energy yield of 34.5 MJ 
anda diesel volumetric density of 0.88 g ml — (refs 39, 40). Of this sum, 1.16 kgCOze 
per kilogram of biodiesel is allocated to soy meal, 1.29kgCO.ekg ‘ to glycerin, 
1.08 kgCO ekg ' to fuel gas (displacing natural gas), 0.76 kgCO,ekg * to heavy 
oil, 0.20kgCO,ekg * to propane fuel mix, 0.96kgCO.ekg ‘ to product gas, 
0.99 kgCOze kg * to light cycle gas and 1.15kgCO.e kg * to clarified slurry oil**. 

Further, we substituted our measured emissions from agricultural inputs for the 
39.4 gCOse MJ" estimated by the GREET model, which are based on emissions of 
278 gCOvekg * for produced soybean grain (GREET, version 1.8d.1). More spe- 
cifically, GREET assumes farm energy use of 825.9kJ per kilogram of soybean 
grain produced (21,310BTU per bushel); additionally, GREET assumes the 
following farming inputs (per kilogram of soybean grain): 1.9 g N fertilizer, 5.6 g 
P,Os, 11.2 g K,O, 157.0 g CaCOs, 0.5 g herbicides and 0.001 g insecticide. One 
kilogram of soybean grain can produce 0.2051 of biodiesel (using conversion 
factors as above), which will contain 7.1MJ of energy. For conversion of 
GREET assumptions to an areal basis, we assume an average production of 
2.5Mgha * (average soybean grain yield in Kalamazoo county, Michigan"). 
Thus, using our site soybean yields and GREET assumptions, we calculate, for 
our site, emissions of 69.2 gCO.em * during soybean farming (278 gCOse per 
kilogram of soybeans X 2,489 kg soybeans per hectare). 

Using measured farming activities (Supplementary Tables 6-8), we calculate 
that overall emissions from farming during the average soybean rotational year in 
KBS LTER fields were lower than assumed in the GREET and summed to 10.2 and 
11.1gCO,em * yr’ for conventionally tilled and no-till systems, respectively 
(Supplementary Table 11). Using a similar procedure to calculate GHG emissions 
per energy unit (gCO2e MJ '; see above), we estimate that KBS LTER emissions 
sum to 7.1 and 6.8 gCO,e MJ’ of biodiesel energy for conventionally tilled and 
no-till systems, respectively. Thus, we added the difference between emissions 
from agricultural inputs in both cases (32.3 and 32.6 gCO.e MJ’) to the carbon 
offset of biodiesel, calculated by the GREET model. This makes our fossil fuel offset 
value greater than those estimated by GREET, that is, 225.6 and 225.9 gCO.e my! 
of biodiesel energy produced for conventionally tilled and no-till systems, res- 
pectively (Supplementary Table 11). Average soybean yields for our sites were 
between 2.0 and 2.3Mgha 'yr~' or ~217gm “yr | (Supplementary Table 1), 
which could produce ~39 g of biodiesel per square metre, or ~1.5 MJm ”, and 
thus could offset ~339 gCO.em yr’. 

For carbon offsets associated with the production of corn grain ethanol, we used 
a comparison between results of life cycle analysis by the EBAMM and GREET 
models, with the latter using dry milling for corn ethanol production***'’. We 
estimate the CO;¢ cost of producing, distributing and combusting fossil gasoline at 
94.0 gCOze per megajoule of gasoline energy, calculated from the EBAMM model 
as reported in ref. 41. For estimation of the CO;e costs of production and distri- 
bution of corn ethanol, we used management data from our site for the agricultural 
phase and the corn ethanol scenario from GREET (version 1.8.d.1). 


Using measured farming activities (Supplementary Tables 6 and 7), we calculate 
that overall emissions from farming during the average corn rotational year at KBS 
LTER fields are between 72.9 and 73.0 gCO,em “yr ' for conventionally tilled 
and no-till systems, respectively (Supplementary Table 8) or 37.7 and 
34.7 gCO.e MJ ', respectively, on the basis of KBS yields (Supplementary 
Table 1) and agricultural inputs used for the Fuel_Prod_TS module of the 
GREET model. In GREET, the emissions associated with the agricultural part of 
corn ethanol production sum to 36.1 gCOze MJ ' (GREET, version 1.8.d.1); the 
differences between GREET and measured emissions are associated with differ- 
ences in agricultural inputs such as fertilizers and energy and assumed crop yield. 
For example, GREET assumes 15.3 gN fertilizer per kilogram of corn, whereas our 
measured values are 21.4 gN per kilogram of corn. Assumed P and K use, on the 
other hand, was larger in the GREET model than at KBS: 5.8 versus 6.8 gP per 
kilogram of corn and 0.3 versus 1.4 gK per kilogram of corn (GREET versus KBS). 
Agricultural lime usage also was higher in GREET than at KBS: 45.3 versus 1.3 g 
per kilogram of corn. Energy-use assumptions in GREET were similar to measured 
values: ~0.4 and ~0.5 MJ per kilogram of corn, respectively. Finally, the actual 
yields at KBS were lower than assumed in the GREET (9.9 versus 6.1 Mg of corn 
per hectare for GREET and KBS, respectively), owing to rainfall variability over the 
period examined; irrigated yields at KBS are ~13Mg of corn per hectare 
(Supplementary Materials). We assumed no emissions associated with land-use 
change for our long-cropped, well-equilibrated KBS soils. All emissions associated 
with the ethanol production phase in GREET were accounted for without adjust- 
ments (except yield differences). 

We used these estimates for calculations of overall emissions from corn grain 
ethanol production, and adjusted COz¢e offsets generated by the GREET model 
accordingly, similar to estimates of CO.e emissions from the production of soy- 
beans (above). We thus estimate that overall emissions from corn-based ethanol 
for KBS are 68.1 and 65.1 gCOze per megajoule of ethanol energy for convention- 
ally tilled and no-till systems, respectively (37.7 and 34.7 gCO,e MJ ' for the 
agricultural part and 30.4 gCO,e MJ ' for the biorefinery part), after adjustment 
of assumed GREET inputs to measured KBS inputs. In comparison, overall emis- 
sions calculated by GREET (version 1.8.d.1) were 66.5 gCO,e MJ ': 36.1 and 
30.4 gCO.e MJ ' for the agricultural and biorefinery parts of the cycle, respect- 
ively. This makes the estimated fossil fuel offset credits for KBS very similar to that 
estimated by GREET for average corn grain ethanol production across the United 
States. This analysis accounts for CO,e emissions from farm operations, trans- 
portation and biorefinery operations. Thus, we estimate the net reduction in COze 
emissions as the difference between emissions from the production, distribution 
and combustion of fossil gasoline (94.0 gCO2e MJ 1. EBAMM, release 1.1‘2°) and 
the distribution and production of corn ethanol, for net savings of 25.9 and 
28.9 gCOze per megajoule of corn ethanol energy produced by KBS fields under 
conventional and no-till management, respectively. 

For ethanol produced from wheat grain, we adjusted GREET model inputs to 
those actually measured at KBS for wheat farming, and estimated overall emissions 
from the agricultural phase in GREET to be 37.8 and 35.6 gCOze per megajoule of 
ethanol energy for conventionally tilled and no-till systems, respectively. Although 
the GREET model has no wheat scenario, we assumed that the use of measured 
agricultural inputs and wheat yields provides a good approximation of the emis- 
sions associated with wheat production, especially because other farming activities 
(travel of workers to farm fields, labour input, machinery production and so on) 
are similar in wheat and corn farming. Furthermore, we assumed that the ethanol 
distillation phase of corn ethanol does not differ significantly from that of wheat 
ethanol. 

We used GREET-generated CO,¢ emissions per megajoule of energy to estim- 
ate potential fossil fuel offset credits when wheat grain is used for bioethanol 
production, that is, 25.8 and 28.0 gCO;e MJ"! for conventionally tilled and no- 
till systems, respectively. Although we assumed that the biorefinery part of wheat 
ethanol production is similar to that of corn ethanol production, the energy value 
of wheat ethanol co-products, dried distillers grains with solubles, is on average 
17% lower than that of corn dried distillers grains with solubles*. Thus, we 
reduced the fossil fuel offset calculated from GREET for wheat grain ethanol 
further by 17%, with offset final values of 21.4 and 23.2 gCOsze MJ ? for conven- 
tionally tilled and no-till systems, respectively. For cellulosic ethanol from wheat 
straw, we estimated net COz¢ offsets of 108.3 gCOze per megajoule of ethanol 
energy for both types of management, after allocation of all agricultural emissions 
to wheat grain production. 

For the estimation of fossil fuel offsets for scenarios with cellulosic ethanol 
production from poplar, alfalfa and herbaceous vegetation (successional and 
successional +N), we similarly used comparisons of emissions assumed in 
GREET with those measured at our site. Emissions associated with agricultural 
inputs assumed in GREET, 6.6 and 18.6 gCO,e MJ! for the production of tree 
and herbaceous vegetation, respectively, were different from those measured at 
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our sites: 2.3, 8.0, 1.0 and 9.9 gCOze MJ ! for poplar, alfalfa, successional and 
successional + N systems, respectively (Supplementary Table 8). Furthermore, 
we adjusted GREET-generated fossil fuel carbon offset values for measured emis- 
sions. We estimate that production of 1 MJ of energy from cellulosic feedstock in 
the alfalfa scenario would save 103.3 gCO,e and that the savings would be 
105.0 gCO,e MJ * in the poplar scenario and 107.3 and 98.4 gCO,e MJ ' in the 
successional and successional +N scenarios, respectively (Supplementary 
Table 8). Under the original GREET assumptions, ethanol production would save 
89.7 gCOze MJ ' from herbaceous vegetation and 100.7 gCOze MJ ' from the 
farmed woody biomass. 

To calculate the potential energy produced from biofuels, we used average KBS 
LTER yields, harvestable biomass from the successional treatment (Supplemen- 
tary Table 1) and ethanol energy content (lower heating value assumed to be 
21.1 MJ per litre of ethanol). We assumed a biorefinery yield of 0.43 | of bioethanol 
per kilogram of dry corn and wheat grain; 0.38 | of bioethanol per kilogram of dry 
wheat straw, corn stover and cellulosic biomass'?“*; and 0.211 of biodiesel per 
kilogram of soybean grain*“°. The energy-equivalent amounts of fossil fuel use 
avoided due to the use of biofuels were calculated (Supplementary Information, 
equation (6)) using lower-heating-value energy contents (34.5 and 21.1 MJ 1! for 
biodiesel and ethanol, respectively) and the specific densities of each fuel'?**”. 

To compare the sustainability of biofuel production between different scenarios, 
we estimated the GHG emission intensity (gCO2e MJ’), defined here as the net 
COz¢ balance per unit of biofuel energy produced in the system (Supplementary 
Information, equations (7)-(8)). 

Model simulations of plant biomass yields at regional scales. We used the EPIC 
model*** to simulate plant biomass yields on regional scales (for site-specific 
simulations and sensitivity analysis, see Supplementary Methods and Sup- 
plementary Figs 4 and 5). EPIC is a comprehensive biophysical process model 
capable of simulating plant yield and net primary productivity for diverse types 
of vegetation using weather, terrain, soil and management data. To simulate 
plant growth, EPIC uses radiation-use efficiency” to calculate the potential daily 
photosynthetic biomass production. Calculated daily potential growth is 
decreased proportionately by the most severe stress due to non-optimal conditions 
of water, nutrients (N and P), air and soil temperatures, soil acidity and soil 
salinity. EPIC has undergone comprehensive validation for a wide range of soils, 
management practices and climates in the United States” and across the world”'. 
Regional EPIC simulations. We implemented an EPIC-based, spatially explicit 
integrative modelling framework* to simulate yields of perennial species grown 
on marginal lands across the ten-state study area in the US north-central region 
(Supplementary Fig. 6). This region extends from North Dakota in the northwest 
corner to Nebraska in the southwest corner, Ohio in the southeast corner, and 
Michigan in the northeast corner. The Canadian-US border defines the northern 
border of the study region. A spatially explicit geospatial database containing soil, 
terrain, weather, land-use/land-cover and management data was used to obtain 
relevant parameters for running the EPIC model. Details of the geospatial database 
follow. 

Soils. Soil Survey Geographic data from the US Department of Agriculture 
Geospatial Data Gateway (http://datagateway.nrcs.usda.gov) were used to create 
a conterminous-scale soil property database at ~30-m scale. Properties included 
the number of soil layers; layer depth; slope gradient and length; bulk density; pH; 
per cent sand, silt, clay and coarse fragments; and per cent organic carbon and total 
nitrogen. To define marginal lands, we also extracted the land capability class 
(LCC) variable, which describes land classes on the basis of use limitation such 
as erosion risk, soil depth, wetness and slope**. There are eight LCCs, ranging from 
class I (lands without any limitations for agricultural use) to class VIII (lands with 
severe limitations). Classes I-IV can support cropland agriculture, whereas classes 
V-VIII contain non-arable land. 

Topography. To derive topographical information, we used data from NASA’s 
Shuttle Radar Topography Mission, which produced a digital elevation model for 
the region at a resolution of 30 m (ref. 53). Geospatial analysis was used to derive 
slope length and gradient for each spatial modelling unit. 

Climate data. EPIC requires daily weather information including daily temper- 
ature (maximum and minimum), precipitation, solar radiation, wind speed and 
relative humidity. The North America Regional Reanalysis database (http:// 
www.esrl.noaa.gov/psd/data/gridded/data.narr.html) was used to derive daily 
weather files at 32-km resolution™. 

Land use and land cover. The US National Agricultural Statistical Service’s crop- 
land data layer for 2008°° and the Soil Survey Geographic database (ref. 29, 
accessed 26 August 2011) were combined to define land use, land cover and soil 
type at a spatial resolution of 60m for the simulation domain. We excluded 
vegetation that has been planted in recreational areas such as parks or golf courses, 
vegetation for erosion control in developed areas and vegetation on federal lands. 


LETTER 


Marginal lands. Marginal lands were identified as rural lands falling into LCCs 
V-VII with slope gradients of <20% (Supplementary Table 12) under non- 
forested vegetation. Available data sets do not allow us to distinguish between 
marginal lands readily available for biofuel feedstocks production and areas cur- 
rently grazed. We use the cropland data layer estimate for grazed land to provide a 
maximum worst-case estimate, recognizing that only a portion of this estimate will 
fall into LCCs V-VII with slope gradients of <20%. 

We gave special consideration to the Sandhills region of Nebraska, where 
unique grass-stabilized sand dune topography distinguishes the dunes from the 
surrounding prairies*®. The sand dunes can be 100 m tall and several kilometres 
long. To exclude the highly erodible dune ridges and slopes from the analysis, 
we used the topographic position index algorithm in ArcGIS°’. This index is a 
classification scheme based on the difference in elevation value between a cell in a 
digital elevation model raster and its neighbours. The extent to which a cell is 
higher or lower than its neighbours, combined with its slope, can be used to assign 
it a landform classification such as valley, slope or ridge. Removing the dune 
ridges and slopes from the analysis reduced the area available for consideration 
as marginal lands by more than 200,000 ha. The interdune valleys falling into 
LCCs V-VI and with slope gradients of <20% were then used for further analysis. 
EPIC simulations and identification of biorefinery locations. For each grid of 
marginal land, we simulated above-ground net primary productivity using the 
calibrated EPIC model with three levels of nitrogen fertilization: 0, 68 and 
123kgN ha 'yr~'. Modelled field size for the estimation of potential productivity 
varied substantially, with a minimum size of 0.36ha and a maximum of 
>650 X 10° ha. Overall, we simulated 78,184 fields having unique combinations 
of soil type, land use and LCC. The connectivity between each field and nearby 
fields was not assessed. The maximum potential overlap of our defined marginal 
lands with grazing lands within each potential biorefinery circle ranges from 0 to 
71%, with a regional maximum average of ~8% (Supplementary Table 3). 

The values of above-ground net primary productivity obtained from EPIC 
(Supplementary Tables 2 and 3) were used to identify the location of potential 
biorefineries that could process the cellulosic feedstocks to ethanol production 
(Supplementary Information, equation (1)). We limited potential biorefineries to 
areas where potential feedstock production in fertilized (68 kgN ha ' yr‘) simu- 
lations were at least 653 Ggyr ' of cellulosic biomass on marginal lands from 
within an economically feasible transportation distance of 80 km (ref. 58). For this, 
we used a moving-window algorithm over the ten-state US midwest region and 
aggregated biomass yields from marginal lands falling within an 80-km-radius 
circle. Subsequently, non-overlapping circles with the highest biomass yields were 
selected for potential biorefinery locations. Actual locations of circles are partly 
dependent on geographic starting points for the moving-window analysis. 
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Stable creeping fault segments can become 
destructive as a result of dynamic weakening 


Hiroyuki Noda! & Nadia Lapusta** 


Faults in Earth’s crust accommodate slow relative motion between 
tectonic plates through either similarly slow slip or fast, seismic- 
wave-producing rupture events perceived as earthquakes’ *. These 
types of behaviour are often assumed to be separated in space and 
to occur on two different types of fault segment: one with stable, 
rate-strengthening friction and the other with rate-weakening fric- 
tion that leads to stick-slip**. The 2011 Tohoku-Oki earthquake 
with moment magnitude M,, = 9.0 challenged such assumptions 
by accumulating its largest seismic slip in the area that had been 
assumed to be creeping®"°. Here we propose a model in which 
stable, rate-strengthening behaviour at low slip rates'’’” is com- 
bined with coseismic weakening due to rapid shear heating of pore 
fluids'*"'*, allowing unstable slip to occur in segments that can 
creep between events. The model parameters are based on labora- 
tory measurements on samples from the fault of the M,,7.6 1999 
Chi-Chi earthquake’’. The long-term slip behaviour of the model, 
which we examine using a unique numerical approach that includes 
all wave effects'®'*, reproduces and explains a number of both 
long-term and coseismic observations—some of them seemingly 
contradictory—about the faults at which the Tohoku-Oki and 
Chi-Chi earthquakes occurred, including there being more high- 
frequency radiation from areas of lower slip*’””', the largest seismic 
slip in the Tohoku-Oki earthquake having occurred in a potentially 
creeping segment®”’, the overall pattern of previous events in the 
area® and the complexity of the Tohoku-Oki rupture’. The implica- 
tion that earthquake rupture may break through large portions of 
creeping segments, which are at present considered to be barriers, 
requires a re-evaluation of seismic hazard in many areas. 

Seismic and geodetic observations have revealed that faults in 
Earth’s crust can accommodate slow tectonic motion in different ways. 
Some fault areas respond with stable, quasi-static motion, with slip 
rates comparable to tectonic rates of millimetres to tens of millimetres 
per year; other fault regions remain locked for years and then experi- 
ence fast dynamic ruptures that we perceive as earthquakes, with 
average slip rates of the order of 1 ms_ ' (refs 1-3). One way to account 
for such observations is to separate fault areas into two types—stable 
areas, which mostly creep, and unstable areas, which produce earth- 
quakes—perhaps with some complexity related to their interaction’. 
Asa consequence, large creeping segments may be regarded as intrins- 
ically ‘stable’ or ‘decoupled’, and, hence, as barriers to seismic rupture. 

The 2011 M, 9.0 Tohoku-Oki earthquake challenged such assump- 
tions by accumulating its exceptionally large seismic slip, of as much as 
~50 m, in the shallower area of the megathrust, which had been con- 
sidered creeping and, hence, stable® "° (Fig. 1a). This unanticipated slip 
not only surprised the earthquake science community but resulted in 
the unexpectedly large magnitude of the earthquake and the extremely 
high tsunami that followed®. Previous large (M,, 7-8) events recorded 
in the area had occurred deeper on the megathrust*®. The geodetic data 
cannot resolve whether the shallower area of large coseismic slip had 
been creeping or not”!®. Another puzzling observation from the 
Tohoku-Oki earthquake is the fact that the lower-slip area produced 


more high-frequency radiation®*’. A similar observation has been 
previously made for the 1999 M,, 7.6 Chi-Chi earthquake’’” (Fig. 1b). 

The earthquake source model that we propose here—in which creep- 
ing segments can generate large seismic slips owing to rapid shear 
heating of pore fluids and the resulting coseismic weakening—can qua- 
litatively explain these and other observations within a single physical 
framework. Our model includes two aspects of earthquake source phy- 
sics that have been gaining acceptance and validation through laboratory 
experiments and comparison of earthquake models with observations. 

The first is the ‘rate-and-state’ nature of fault friction at low, aseis- 
mic slip rates, which has been conclusively documented in laboratory 
experiments and used to reproduce, both qualitatively and quantita- 
tively, a number of earthquake-source observations’’”*. In the rate- 
and-state friction laws (Methods), the steady-state frictional strength, 
Tss, depends on the slip rate, V, and the effective normal stress, o — p, 
and is given by 


t3s(V)~(o — p)|fo + (a—b) In(V/Vo)] (1) 


Here cg is the fault normal stress, p is the pore pressure of fluids that 
permeate the fault, fo is the steady-state friction coefficient at a reference 
slip rate Vy and a — bis the rate-and-state parameter, which can be used 
to model both stable, rate-strengthening fault segments (a — b > 0) and 
potentially seismic, rate-weakening fault segments (a — b < 0). 

The second emerging property of the earthquake source is addi- 
tional substantial fault weakening at seismic slip rates. Although 
theories of such weakening have a long history’’, relatively recent 
laboratory confirmations of this phenomenon’ have lent further 
credence to this notion in earthquake science. One of the physical 
mechanisms for coseismic weakening is the temperature increase 
due to rapid shear heating, which causes thermal expansion of pore 
fluids'?’°. If the rock surrounding the fault shear zone has sufficiently 
low permeability, the resulting increase in p lowers o — p and, hence, 
the friction resistance in equation (1). Such thermal pressurization has 
been suggested as an important source mechanism for the 1999 Chi- 
Chi earthquake’*”’. Several other mechanisms of coseismic weakening 
have been proposed and used to simulate fault-slip phenomena, 
including in the Tohoku-Oki area’***”. 

Our fault model (Fig. 1c, d) incorporates a patch that combines 
stable, rate-strengthening properties at low slip rates with substantial 
coseismic weakening due to thermal pressurization of pore fluids. Such 
a set of properties is motivated by laboratory measurements” on 
samples from the Chelungpu fault, Taiwan, which hosted the 1999 
M,,7.6 Chi-Chi earthquake, one of the most well-studied seismic 
events'”!°°, Chi-Chi’s fault rupture nucleated in the southern region 
of the Chelungpu fault and propagated mainly to the north, where 
it accumulated its largest coseismic slip, of about 8m (Fig. 1b). 
However, the southern region, of substantially lower slip, generated 
more high-frequency radiation’’, just as observed for the Tohoku- 
Oki earthquake”. Laboratory measurements’ on rock samples from 
two shallow boreholes that penetrated the Chelungpu fault indicate 
that, at low slip rates, the southern fault region has earthquake-prone 
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Figure 1 | Schematics of the fault model and its correspondence to great 
earthquakes. a, b, Slip distributions in the 2011 Tohoku-Oki earthquake 

(a; adapted from ref. 8) and the 1999 Chi-Chi earthquake (b; adapted from ref. 
20). They featured a higher-slip patch producing less high-frequency radiation 
than the neighbouring lower-slip patch. The two squares outline the patches of 
our model. c, Fault model motivated by experimental studies’’ for the 


rate-weakening friction (a — b <0) whereas the northern region has 
stable, rate-strengthening friction (a — b > 0). However, the rock per- 
meability is lower in the north by about two orders of magnitude’, 
indicating that the northern part of the fault is more susceptible to 
coseismic weakening by thermal pressurization of pore fluids. Moti- 
vated by these measurements, our model considers the interaction of 
two patches, A and B, with the corresponding laboratory-measured 
properties (Fig. 1, section1 of Supplementary Information, Sup- 
plementary Fig. 1 and Supplementary Table 1). To the best of our 
knowledge, this is the first study of long-term fault behaviour moti- 
vated by laboratory studies of materials from an actual fault zone. 

We study the implications of this rich fault rheology in a simplified 
model of a planar fault embedded into a homogeneous, elastic space. 
This set-up allows us to investigate the qualitative consequences of the 
rheology without additional complexity associated with fault geometry 
and bulk properties. It also enables us to use a unique and highly 
efficient computational methodology'*’* for simulation of earthquake 
sequences and slow slip, which incorporates both slow tectonic loading 
and all inertial (wave) effects of dynamic ruptures but is at present 
restricted to relatively simple fault geometries. The numerical 
approach solves for the time evolution of fault slip and stress governed 
by a highly nonlinear coupled system of elastodynamic equations 
describing stress interactions, and includes rate-and-state friction on 
the fault that depends, in part, on the evolving slip rate and pore 
pressure, and explicit calculation of temperature and pore pressure 
evolution on the fault that incorporates fault-normal diffusion of 
heat and pore fluids (Methods). On the fault, the two interacting 
patches are surrounded by a rate-strengthening region with no coseis- 
mic weakening, which, in turn, is surrounded by areas steadily moving 
at a representative plate rate of 10.-?ms |. 

The resulting model response in terms of earthquake sequences and 
slow slip qualitatively reproduces a range of observations for both the 
Chi-Chi and the Tohoku-Oki earthquake on scales ranging from that 
of a single seismic event to several earthquake cycles (Figs 2 and 3). 

The produced earthquake sequences consist of two types of event: 
smaller, more-frequent events that rupture only patch A, and larger, 
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Chelungpu fault, which hosted the Chi-Chi earthquake. At aseismic slip rates, 
patch A has instability-prone, rate-weakening friction whereas patch B has 
stable, rate-strengthening friction. However, patch B is more susceptible to 
coseismic weakening. d, Schematic plots of how shear resistance in the patches 
depends on their respective slip rates. 


model-spanning events that spread over both patches (Fig. 2a). All 
seismic events initiate in patch A, which is susceptible to earthquake 
nucleation owing to its rate-weakening properties at low slip rates, and 
many of them arrest without rupturing patch B. However, occasional 
events manage to penetrate into patch B and activate coseismic weak- 
ening there, essentially changing the patch from being (interseis- 
mically) stable to (coseismically) unstable. In such model-spanning 
events, the slip is largest in patch B, owing to larger pore-pressure- 
induced weakening and the associated larger stress drop (Supplemen- 
tary Fig. 2a). Whether dynamic rupture is successful in activating 
coseismic weakening in patchB depends on a number of factors 
(Supplementary Information, section 3). 

Such an overall pattern of seismic slip is consistent with observa- 
tions and inferences for the areas of both the Tohoku-Oki and the 
Chi-Chi earthquake. In the Tohoku subduction zone, many M, 7-8 
events have occurred on the deeper part of the subduction interface, 
which experienced lower slip in the Tohoku-Oki earthquake* (Fig. 1a). 
Similarly, a series of trench surveys along the Chelungpu fault has 
revealed that earthquakes occur more often in the southern part of 
the Chi-Chi earthquake source region, which experienced less slip, 
than in the northern part*®. The model dimensions are tailored to 
the size of the Chi-Chi source, and many model-spanning events 
indeed have peak slip comparable to the 8 m observed in the northern 
part of the Chi-Chi rupture. 

Patch B, which combines laboratory-measured properties that allow 
for both interseismic creep and rapid coseismic slip, produces a rich 
variety of behaviours in a single model (Fig. 2), from creeping with the 
long-term convergence rate (that is, being fully decoupled), to occa- 
sionally participating in large seismic events, to being locked (that is, 
fully coupled) (section 2 of Supplementary Information, Supplemen- 
tary Figs 2b and 3, and Supplementary Videos 1 and 2). Such behaviour 
provides a new physical interpretation of the conceptual notion of 
‘conditionally stable’ fault behaviour’. Importantly, immediately 
before a seismic event manages to propagate through it, patch B can 
be creeping (decoupled), in contrast to the notion that large earth- 
quakes rupture mostly locked patches or asperities**. This richness of 
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Figure 2 | Long-term behaviour of the model in terms of earthquake 

sequences and aseismic slip. a, Cumulative slip distribution along the fault at 
mid depth. Earthquakes are shown by the solid lines, which are plotted with a 
time step of 1s. Dashed lines represent slow slip every 50 yr, with the numbers 
indicating the simulated time in years. Patch A experiences stick-slip. Patch B 


behaviour stands in contrast to patch A, which has relatively simpler 
stick-slip response. 

In addition to capturing the long-term behaviour, our model simul- 
taneously reproduces the coseismic observations on less-pronounced 
high-frequency radiation from the area of the largest slip in both the 
Chi-Chi and the Tohoku-Oki earthquake”””". Patch B has higher slip 
but a smoother slip-rate history than does patch A (Fig. 3a, b), as a 
result of both more-gradual weakening through pore pressurization 
and the velocity-strengthening nature of the initial response at the 
rupture front. This is reflected in the power spectrum of the source 
time functions (Fig. 3c): in patch B, the low-frequency content is larger 
but the spectrum decays more rapidly. 

The interaction of the two patches can also produce the complex 
rupture pattern (Fig. 3a and Supplementary Video 1, from 59s to 
1 min 2s) inferred for the Tohoku-Oki earthquake’. After nucleating 
near the middle of the fault, the rupture first propagates mostly to 
the left, into patch A. As thermally induced weakening of patch B 
progresses, the rupture there accelerates and accumulates larger slip, 
creating stress concentration over patch A and causing re-rupturing of 
patch A. This re-rupturing would be more prominent in models 
with higher slip contrast between the two patches. The re-rupturing 
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can both creep and participate in earthquakes, accumulating large seismic slip. 
b, Evolution of fault coupling (Methods Summary), with locked segments 
corresponding to a coupling of 1. White lines indicate the extent of earthquake 
ruptures. Note that patch B can be either creeping or locked before model- 
spanning seismic events. 


feature is also present in simulations with patches of identical frictional 
properties but different permeabilities; however, the behaviour of that 
model differs in other important respects (section 4 of Supplementary 
Information, Supplementary Figs 4 and 5, and Supplementary Table 1). 

Our study shows that both long-term and coseismic observations 
from the areas of the Tohoku-Oki and Chi-Chi earthquakes can be 
explained and reconciled by the presence of fault patches that are 
stable at low slip rates but can experience shear-induced coseismic 
weakening. Our results demonstrate how fluids can cause stable 
fault behaviour to become destructive through thermal fluid pressu- 
rization during rapid slips. In fact, experimental studies have shown 
that wet, clay-rich sediments, which are likely to be present in shallow 
portions of subduction zones, are indeed susceptible to thermal 
pressurization’’. Other coseismic weakening processes'***-** may have 
similar effects. Our findings add to the accumulating evidence that 
earthquake source processes may be controlled by the interaction 
between rate- and state-dependent friction properties and effects of 
fluids that permeate Earth’s crust; for example, fluid effects related to 
dilatancy have been shown to facilitate slow slip events*®. Combina- 
tion of another coseismic weakening formulation with a mix of rate- 
strengthening or rate-weakening low-slip-rate properties has been 
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Figure 3 | Coseismic behaviour of a model-spanning event. The event is that 
shown in yellow in Fig. 2a. a, Snapshots of slip-rate distribution every 1 s along 
the fault at mid depth, displaced vertically for clarity. The event nucleates near 
the middle of the fault, ruptures patch A, ruptures patch B with larger slip and 
then re-ruptures patch A, as inferred for the Tohoku-Oki earthquake’. c,, 
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shear-wave speed. b, Slip-rate histories at representative points. The history of 
patch B is smoother than that of patch A, owing to initial rate-strengthening 
behaviour and subsequent more gradual weakening. c, Frequency structure of 
the slip-rate histories. Patch B produces larger slip but less high-frequency 
content, as observed in the Tohoku-Oki and Chi-Chi earthquakes”. 
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shown to match the long-term slip pattern in a geometrically realistic 
model of the Tohoku-Oki area*’; however, coseismic properties of the 
resulting ruptures have not been explored and would be difficult to inter- 
pret owing to the quasi-dynamic approximation used in those studies. 
Findings such as these have important implications for seismic 
hazard, because they suggest that currently creeping (decoupled) fault 
regions, which are thought to be stable and aseismic, may participate in 
destructive events and host large seismic slip. After large events, such 
regions would stay locked for a while, but would eventually accumulate 
enough stress to start creeping again, obscuring the evidence of their 
destructive past. Hence, using only the relatively short recorded seis- 
mic and geodetic history to estimate seismic hazard is inadequate, as 
demonstrated by the unexpected 2011 Tohoku-Oki earthquake. For 
example, on the basis of fewer than 100 years of recorded history”, 
the creeping section of the San Andreas fault in California is thought 
to provide a barrier between the southern and northern locked parts 
of the fault, essentially preventing a massive earthquake that would 
involve all of California from San Diego and Los Angeles to San 
Francisco. However, our study provides a plausible physical mecha- 
nism by which that segment could join such a massive event. The 
possibility that dynamic rupture may penetrate into regions stable at 
low slip rates is important for the down-dip limit of large earthquakes. 
To understand whether a creeping segment will participate in seis- 
mic events, it is important to assess whether it is susceptible to coseis- 
mic weakening. This can be achieved through palaeoseismic studies 
that look for evidence of past rapid slip*°. However, such studies 
are not feasible in many places. This highlights the importance of 
materials-science-based models, such as the one presented here, which 
can incorporate laboratory findings and explore a full range of poten- 
tial fault behaviours. To be not only qualitatively but also quantita- 
tively useful, the models need to be refined on the basis of more 
experimental studies on physical fault properties under realistic 
depth-dependent temperature-pressure conditions. 


METHODS SUMMARY 


Our simulation approach produces a sequence of spontaneous earthquakes and 
aseismic slip in a fault model, accounting for both inertial effects and long-term 
tectonic loading'*'*. In the model, a planar fault is embedded in a linear, elastic, 
infinite medium. Part of the fault is governed by rate- and state-dependent 
friction’' coupled with fluctuation in the pore fluid pressure due to frictional heating 
and diffusion normal to the fault’? "”. The surrounding fault areas move with the 
assigned plate convergence rate, providing tectonic-like loading for the fault region 
of interest. The coupled problem of the elastodynamic equations in the bulk and the 
friction boundary condition on the fault is solved by a three-dimensional spectral 
boundary integral equation method that uses an adaptive time-stepping proced- 
ure’®’®, The fault-normal diffusion equations for temperature and pore pressure are 
solved by an unconditionally stable spectral method’*. To investigate the combined 
effect of rate strengthening at low slip rates and efficient dynamic weakening at 
coseismic slip rates, we create a model with two fault patches of different properties 
(Fig. 1c). The seismogenic region that includes the two patches is comparable in size 
to the source region of the M,,7.6 1999 Chi-Chi earthquake’””®. The two patches 
have laboratory-measured frictional and hydraulic properties'? based on samples 
collected from boreholes penetrating the northern and southern part of the 
Chelungpu fault, Taiwan, which hosted the Chi-Chi earthquake. Specific values 
of the parameters used are given in Supplementary Information. The fault coupling 
plotted in Fig. 2b is defined as 1 — V/V,; when VS V,, and as 0 when V> V,, 
where V is the slip rate and V,, is the long-term plate rate. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Our model is mathematically represented by a highly nonlinear coupled system of 
elastodynamic equations describing stress interactions, and includes rate-and- 
state friction on the fault that depends, in part, on the evolving slip rate and pore 
pressure, and explicit calculation of temperature and pore pressure evolution on 
the fault that incorporates fault-normal diffusion of heat and pore fluids. Here we 
summarize the set of partial differential equations solved and the distributions 
of parameters assumed in the present study. For the solution techniques and 
numerical implementation, see ref. 16. 

The elastodynamics of a three-dimensional infinite body is treated by a spectral 
boundary integral equation method'*'*. Shear traction on a planar fault 7 is 
expressed as 


(x,t) =T +|x,t; V]— oa V(x,t) 


where x is the position vector spanning the fault, t is time, zo is the shear traction 
that would act on the fault if it were prevented from slipping, Vis the slip rate, 1 is 
the shear modulus, c, is the shear-wave speed and ¢ is the stress redistribution 
term, which depends on the previous spatiotemporal distribution of V. The spatial 
Fourier transform of ¢, denoted by %, is expressed as 


0 
@(k,t) = Cs(k) D(k,t) + | Cp(k,t—t’)D(k,t’) dt’ 


—ty 


where kis the wavenumber vector, Cs and Cp are matrices of functions'*, Dand D 
are the spatial Fourier transforms of the slip and slip rate distributions, res- 
pectively, and t, is the time window within which we consider dynamic propaga- 
tion of elastic waves. We take this window to be long enough for the shear wave to 
propagate over the two patches. 

Frictional resistance of the fault is given by 


t=(s— pyflv.0~ (4) 


where a is the elastodynamic normal stress, p is the pore pressure on the fault, 
o — pis the effective normal stress and f is the friction coefficient, which depends 
on the magnitude of slip rate, V, and a state variable, 0, that characterizes the 
evolving properties of contacts in the shearing layer. For the friction law, we use a 
regularized version”’ of the experimentally derived rate-and-state laws ** with 


the ageing state evolution equation 


f =aarcsinh = bas (2 + blog(Vo0/ by 


a 
Vv V0 
ee , 5 
“fo 4+ alog(¥) blog( : ) (5) 
a_, vo 
dt OL 


where fo is the steady-state friction coefficient at a reference slip rate Vo, and a, b 
and L are the parameters that quantify variations in friction due to varying values 
of the slip rate and state variable. Note that the state variable evolves with slip, so 
that the friction law (equation (5)) combines the dependence of friction on both 
slip and slip rate. The steady-state value of friction, f,,, that prevails at a constant 
slip rate is given by 


fis = aarcsinh id ex fo — blog (V/Vo) 
Ss 2 V, p 


0 a 


sie get Woe( 7) 


By varying the sign of the parameter a — b, the laws can be used to describe both 
stable, rate-strengthening fault segments (a — b > 0) and potentially seismic, rate- 
weakening fault segments (a — b <0). 

Motivated by the laboratory-measured properties of the rock samples from the 
Chelungpu fault’, we combine the rate-and-state formulation (equation (5)) with 
the coseismic weakening due to pore fluid pressurization. To that end, we explicitly 
track the evolution of temperature, T, due to frictional heating and the associated 
evolution in pore pressure, accounting for fault-normal diffusion of both heat and 
pore fluids'***: 


OT (x,y,z,t) 
ot 


a2 
OT (x,y,Z,¢ o 
(x,y,Z,t) r 


Oth dy? pc 


Op(x,y,Z.t) O° p(x,y,z.t) 1 OL (Zt) 
at nr nr 
; y 


Here y is the fault-normal direction, x and z parameterize the fault plane, w(x, y, 
z, t) is the shear heating source caused by fault slip, %, and oy are respectively the 
thermal and hydraulic diffusivities, pc is the specific heat capacity and A is the pore 
pressure change per unit temperature change under undrained conditions. Note 
that permeability of near-fault materials affects the value of ay. In the simulated 
examples, it is assumed that V is accommodated by a fault layer of half-width w, 
with the resulting shear heating source in the form 


exp(—y?/2w*) 
o=1(x,z,t): V(x,z,t) —_=—_— 
V2Tw 


The resulting evolution of pore pressure is then coupled with frictional resistance, 
because pore pressure enters the friction formulation (equation (4)). 

Because we are interested in fault phenomena on a wide range of timescales, 
from the weakening process at the rupture front (order of milliseconds) to the 
recurrence of large earthquakes (order of 100 yr), we need an accurate and stable 
numerical procedure for the integration of the state variable, 0, the temperature, T, 
and the pore pressure, p. We use an algorithm" in which the time integrations of 0 
and Fourier coefficients of T and p are explicit, unconditionally stable, second 
order in time and spectral in wavenumber, and that allows adaptively varying time 
steps on a logarithmic scale. We use the adaptive stepper developed in ref. 36. The 
details of numerical implementation are described in ref. 16. 

The geometry of the model and the physical parameters used in the simulation 
presented in the main article are shown in Supplementary Fig. 1 and 
Supplementary Table 1. The spatial distribution of the parameters is infinitely 
differentiable because the boundaries between regions with different properties 
are represented by a smoothed boxcar function 


B(x; W,,W2) 
1 |x|< Wy 
1 W,—-W; W.-W, 

=¢ —/1+ tanh{| ———— + ——_ W, <|x|< W: 
5 [i+ nn ae tome) | Wish <We 
0 W2 <|x| 


where W, and W, are the half-lengths of the flat plateau and the support of the 
smoothed boxcar, respectively. The smoothed boxcar functions, which take the 
value of 1 inside patches A and B are given respectively by 


Ba(x,z) =B(x-+ (Wi + W2)/2; Wi,W2)B(z; Wi,Wa) 


and 


Bg(x,z) =B(x— (W, Tr W2)/2; W,,W2)B(z; W,,W2) 
The non-uniform parameters are distributed as 


fo=foa + (fos —foa) Bp 
b=byBa + bpBp 
L=\lm+ (Lap —l1 m)(Ba + By) 


log (ny) = log (Ohya) + (log (Onyp) _ log (Ohya )) By 
A=Aj,By + ApBp 


w=l1m4 (Wap 


where subscripts A and B indicate the values inside patches A and B, respectively. 
The parameter values in the patches are listed in Supplementary Table 1, which 
also gives the parameters uniform over the model. We choose the value of L to be 
large (1 m) far from the patches to confine the ruptures better. 

The fault coupling plotted in Fig. 2b is defined as 1 — V/V, when V = V,, and 
as 0 when V> V,, where V is the slip rate and V,,) is the long-term plate rate. 
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The introduction of dairying was a critical step in early agriculture, 
with milk products being rapidly adopted as a major component 
of the diets of prehistoric farmers and pottery-using late hunter- 
gatherers’ °. The processing of milk, particularly the production of 
cheese, would have been a critical development because it not only 
allowed the preservation of milk products in a non-perishable and 
transportable form, but also it made milk a more digestible com- 
modity for early prehistoric farmers®”°. The finding of abundant 
milk residues in pottery vessels from seventh millennium sites 
from north-western Anatolia provided the earliest evidence of milk 
processing, although the exact practice could not be explicitly 
defined’. Notably, the discovery of potsherds pierced with small 
holes appear at early Neolithic sites in temperate Europe in the 
sixth millennium Bc and have been interpreted typologically as 
‘cheese-strainers’”, although a direct association with milk proces- 
sing has not yet been demonstrated. Organic residues preserved in 
pottery vessels have provided direct evidence for early milk use in 
the Neolithic period in the Near East and south-eastern Europe, 
north Africa, Denmark and the British Isles, based on the 5'°C and 
AC values of the major fatty acids in milk'*. Here we apply the 
same approach to investigate the function of sieves/strainer vessels, 
providing direct chemical evidence for their use in milk processing. 
The presence of abundant milk fat in these specialized vessels, com- 
parable in form to modern cheese strainers", provides compelling 
evidence for the vessels having being used to separate fat-rich milk 
curds from the lactose-containing whey. This new evidence empha- 
sizes the importance of pottery vessels in processing dairy products, 
particularly in the manufacture of reduced-lactose milk products 
among lactose-intolerant prehistoric farming communities®”. 

The emergence of dairying was a major innovation in prehistoric 
societies, enabling the supply of nutritious food without the slaughtering 
of precious livestock"®. The processing of milk, particularly the produc- 
tion of cheese, would have been an important development; however, the 
origins of cheese making are currently unknown. Iconographic and 
written evidence from the mid-third-millennium Bc weakly documents 
the history of cheese making’*”; although its origins probably lie much 
earlier in prehistory. 

The production of cheese is a technically complex process. It 
involves the coagulation of milk, either enzymatically or by acid treat- 
ment, yielding the semi-solid curds (a combination of the major milk 
nutrients—protein, mainly casein, and milk fat) and then removal of 
the water-soluble lactose by straining off the liquid whey. Today, the 
straining process is commonly achieved using a coarse textile, “cheese- 
cloth’, or plastic or metal sieves as the curds are sufficiently coarse to be 
strained effectively through'**. 

Although straining of the curds through textile or even wicker con- 
tainers would have been possible in prehistoric times, fragments of 
pottery pierced with small holes (2-3 mm in diameter, Fig. 1) recovered 


from central European Linear Pottery (Linearbandkeramik) culture 
sites have attracted considerable attention. Critically, the Linear Pot- 
tery culture represents the first communities using cultivated plants and 
domestic animals in interior central Europe. By typological comparison 
with modern and ethnographic perforated vessels'’ (Supplementary 
Fig. 1), these sieve vessels have been interpreted as ‘cheese-strainers’'®”>. 
Indeed, the coexistence of such vessels with archaeozoological evidence 
for domestic ruminants led to the proposal of the emergence of milk 
exploitation at the beginnings of animal domestication, at least a mil- 
lennium earlier than Sherratt’s secondary products revolution hypo- 
thesis predicted’. However, other suggestions have been made 
concerning the use of sieve vessels, for example as flame covers, honey 
strainers or for beer making'*"*. Preliminary investigations of similar 


Figure 1 | Drawings of representative reconstructed sieve vessels and 
photographs of specific sieve fragments from the region of Kuyavia 
submitted to lipid residue analyses. a, b, KUY0750, from Brzes¢ Kujawski site 
3. c, d, KUY0757 from Smolsk site 4. The typology of the sieve vessels is 
comparable to those used by modern-day cheese producers (Supplementary 
Fig. 1). Drawings used with permission from ref. 20. 
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Table 1 | List of sherds selected for lipid residue analyses 


Early — Classic Late Unknown Total 
phase phase phase 
Sieves from the region of Kuyavia 
Ludwinowo 6 - 2 (2) - - 2 (2) 
Ludwinowo 7 - 19(12) 16(8) 2(1) 37(21) 
Brzes¢ Kujawski 3 - 2 (2) - - 2 (2) 
Brzes¢ Kujawski 4 - - 2 (2) - 2 (2) 
Miechowice 4 1q) 2() - - 3.3) 
Smolsk 4 - - 2 (2) - 2 (2) 
Wolica Nowa 1 - 1(1) - - Ly 
Stare Nakonowo 2 - 1(1) - - 1(1) 
Non-perforated vessels from Ludwinowo 7 
Cooking pots 6 19 19 - 44 
Bowls 1 2 4 - 7 
Flasks 2 6 vA - 15 


Each non-perforated sherd corresponds to a single vessel. For the sieves, the total number of individua 
sieves is indicated in parenthesis. Early phase, 5,400/5,300-5,200 cal. 8c; classic phase, 5,200-5000 
cal. Bc and late phase, 5,100/5,000-4,900/4,800 cal. 8c?°*?. 


LETTER 


strainer vessels from third millennium Happaran sites in Asia showed 
the presence of fatty acids, although this data alone is insufficient to 
ascribe an association with dairy product processing”’. The development 
of lipid biomarker and stable isotope proxies that allow identification of 
lipid sources from absorbed organic residues in pottery vessels, offers a 
direct means of testing ideas related to specialized uses of vessels, the 
antiquity of dairying and related processing activities'*. 

Here we investigate the function of Linear Pottery sieves using chro- 
matographic (gas-chromatography), spectrometric (gas chromatography- 
mass spectrometry, GC-MS) and isotopic (GC-combustion-isotope 
ratio MS, GC-C-IRMS) analyses of organic residues in archaeo- 
logical potsherds from settlements in the region of Kuyavia, Poland. 
Furthermore, we compare the contents of these vessels with other types 
of pottery to assess whether specialization existed in the use of vessels. 
Lipid residue analyses focused on 50 fragments of sieves from 34 
vessels recovered from Linear Pottery settlements along the lower 
Vistula river (including Brzes¢ Kujawski 3 and 4, Miechowice 4, Smolsk 
4, Wolica Nowa 1, Stare Nakonowo 2, and Ludwinowo 6 and 7; Sup- 
plementary Fig. 2) and, for comparison, coarse ware potsherds from the 


Figure 2 | Partial gas chromatograms of total 
lipid extracts from typical pottery from 
Ludwinowo 7 (Poland). a, Sieve and bowl: 
degraded animal fat with a wide distribution of 
triacylglycerols (sieve LDW0769). b, Cooking pot: 
degraded animal fat with a narrow distribution of 
triacylglycerols (LDW1020). c, Collared flask, 
beeswax (LDW1070) and associated shapes: 

d, Sieve LDW0769; e, bowl LDW1059; f, cooking 
pot LDW1039; g, collared flask LDW1074. AL, 
alkanes; DAG, diacylglycerols; FFA n, fatty acids 
with n carbon atoms and no double bonds; br, 
branched; HW, wax hydroxymonoesters; K, mid- 
chain ketones with 31, 33 and 35 carbon atoms; IS, 
internal standard (n-tetratriacontane); MAG, 
monoacylglycerols; OL, alcohols; TAG, 
triacylglycerols; with M, acyl carbon number; and 


Ketones 


Relative intensity 


W, wax monoesters. 
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three major vessel types from Ludwinowo 7 (‘cooking’ pots or kiimpfe, 
n= 44; bowls, n = 7; and collared flasks, n = 15; Supplementary Tables 1 
and 2) dating from the early (5,400/5,300-5,200 calibrated calendar years 
BC (cal. BC)), classic (5,200-5,000 cal. Bc) and late (5,100/5,000—4,900/4,800 
cal. Bc) phases of the Linear Pottery in Kuyavia”’” (Table 1 and Fig. 2d-g). 

Lipids were extracted using well-established protocols’*”. A high 
proportion of the sieve sherds, around 40%, contained lipid residues, 
with concentration up to 875 ugg‘ (mean 166 ugg ', for the sherds 
containing lipids), consistent with observations from food processing 
vessels from other European archaeological sites’’”’. Likewise, lipid 
residues were extracted from around 59%, 29% and 33% of the cooking 
pots, bowls and collared flasks, respectively, with concentrations up to 
2mgg ' (means for sherds with residues: 225, 13 and I5ugg ', 
respectively); the high concentrations of lipids in cooking pots suggests 
their extensive use in cooking fat-rich animal foodstuffs. 

Most total lipid extracts were dominated by Cy¢.9 and Cig. fatty 
acids, accompanied by odd-number and branched-chain fatty acids 
(Cy7.9 and C,7.9p,) which are biomarkers of bacterial populations found 
in the rumen of milk-producing species as well as ruminant animal 
fats** (Fig. 2a, b). Triacylglycerols (TAGs) and their degradation pro- 
ducts (diacylglycerols (DAGs) and monoacylglycerols (MAGs))”* were 
observed in around 90% of the animal fats, indicating the high level 
of preservation of residues. As fresh dairy and adipose fats are charac- 
terized by wide and narrow distributions of TAGs, respectively”, the 


contrasting distributions in sieves (C4. to Cs4) and cooking pots (C4 to 
C54) suggest specialization in the use of the two vessel types (Fig. 3a, b). 
This was confirmed through the determination of the 5'°C values of 
the individual n-alkanoic acids, palmitic (Cj¢6.9) and stearic (Cjg.0) 
acids. The use of the A°C (= 8Cig.o — 8'°Ci6.0) proxy removes 
the exogenous factors linked to the environment, thereby highlighting 
the metabolic and biosynthetic characteristics of the animal fat source 
and allowing the distinction between non-ruminant and ruminant 
fats, and adipose from dairy fats, to be drawn”. 

Comparison of the A’°C values obtained for the 16 animal fat resi- 
dues preserved in sieve fragments and corresponding to 12 individual 
sieves with modern reference fats** showed that all the residues but 
one exhibited A’°C values expected for dairy fats (Fig. 3c, e). Likewise, 
the residues in bowls (n = 2) displayed the same range of values cha- 
racteristic of dairy fats. Significantly, all the extracts from cooking pots 
(n = 24) showa very different signature plotting within, or on the edge 
of, the isotopic range expected for ruminant adipose fats (Fig. 3d, f). 
The presence of odd-carbon number ketones (C3; to C35) in two 
cooking pots implies that they were heated to high temperatures” 
(Fig. 2b). All the biomarker and isotopic evidence indicate markedly 
different uses of the cooking and strainer vessels. 

The presence of beeswax in 80% of the collared flasks with residues 
was confirmed by GC-MS through the identification of odd-carbon 
numbered alkanes (C2, to C33), even-carbon numbered alcohols (C4 
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Figure 3 | Histograms of triacylglycerol distributions and fatty acid carbon 
isotope compositions of lipids extracted from Kuyavia pottery. TAG 
distributions of total lipid extracts of sieves from the region of Kuyavia (including 
Ludwinowo 7) (a), and cooking pots from Ludwinowo 7 (b). The blue bars denote 
TAGs present in both adipose and milk fats whereas those in red are only detectable 
in milk fat. c, d, Plots of the 5'°C values for the Cy¢9 and Cis.0 fatty acids prepared 
from animal fat residues extracted from sieves and cooking pots (see 
Supplementary Tables 1 and 2). Each data point represents an individual vessel (the 
orange and yellow data points represent duplicates—of two and four fragments, 
respectively—of one sieve vessel each). The analytical error (+0.3%bo) is 
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approximately the size of the points on the graph. e, f, A'°C values 

(= 8°Cigo — 8'Ci¢0) of the extracts plotted against their 8'°Cy6 values from the 
same potsherds. Ranges show the mean + 1 s.d. of the AC values for a global 
database comprising modern reference animal fats from United Kingdom (animals 
raised on a pure C; diet), Africa, Kazakhstan, Switzerland and the Near East’. 
Significantly, the residues extracted from sieves (c and e) mostly contain dairy fats, 
whereas cooking pots (d and f) contain ruminant adipose fats. The difference in the 
AC means is ~2.5%o, which is highly significant (t-test; P< 0.0005). The figure 
clearly demonstrates the presence of dairy residues in sieves from the region of 
Kuyavia and specialization in pottery use at the Linear Pottery site of Ludwinowo 7. 
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to C34), even-carbon numbered wax monoesters and hydroxymonoe- 
sters (Cy to C59), and in some cases diesters*’”* (Fig. 2c). Beeswax was 
also detected in one cooking pot. 

These data provide evidence for the use of the Linear Pottery sieves 
to process milk, between around 5,200 and 4,900-4,800 cal. Bc. The 
specific features of the potsherds, characterized by the presence of 
randomly distributed holes and the presence of high concentration 
dairy residues in the sieves support arguments in favour of the use 
of Linear Pottery sieves for cheese making, as butter making would not 
require such technology'”"*. Beeswax was present in three sieves, pro- 
bably for waterproofing, helping to turn the cheese out of the mould or 
straining honey from the comb. Interestingly, the bowl-form vessels 
contained dairy fats and could have been used in combination with the 
sieves to collect filtered whey"’. 

Hence, we demonstrate the first evidence for specialized use of 
vessels during the early Neolithic between approximately 5,400/ 
5,300 and 4,900/4,800 cal. Bc. No obvious evolution in use exists 
through the period studied, with vessels of different ages containing 
analogous lipids. In contrast to the sieves and the bowls, the cooking 
pots were largely used for processing ruminant carcass products, pro- 
bably cattle or sheep and goats. Animal bone assemblages from Linear 
Pottery settlements in Kuyavia are characterized by high proportions 
of the bones of domestic cattle (from 68% to 80% of the overall number 
of identified specimens), relatively few sheep and goat bones (from 
13% to 18% of the overall number of identified specimens), and very 
few pig or wild herbivore bones, indicating a high reliance on domestic 
ruminants, especially cattle*® (M. Osypinska, unpublished observa- 
tions). The high abundance of cattle at milk-yielding sites points to 
the importance of this ruminant in the intensification of milk use. The 
presence of beeswax in most of the collared flasks suggests either honey 
storage or, more probably, of the use of beeswax as a waterproofing 
agent”, allowing their use in the storage or transport of water-based 
commodities. One collared flask appears to have been waterproofed 
with ruminant adipose fat rather than beeswax. The extensive recovery 
of beeswax in pottery from Ludwinowo 7 underlines the importance of 
bee products during the early Neolithic in Europe”. 

The evidence for the specialized use of Linear Pottery sieves in 
association with milk is important for three main reasons: (1) the 
typology of the sieves and the presence of dairy fats are consistent with 
milk processing, providing the earliest evidence of cheese making, 
which is notable because the manufacture of cheese increases the ease 
of handling of milk and allows the nutritional properties of milk to be 
readily available through the year; (2) the processing of milk to manu- 
facture low-lactose-content cheese is consistent with the predicted low 
level of lactase persistence in northern Europe in the early Neolithic’; 
and (3) the evidence of milk use by the people of the Linear Pottery 
culture is consistent with the predicted increase in frequency of the 
—13,910*T allele associated with lactase persistence among prehis- 
toric northern Europeans in this region’. 


METHODS SUMMARY 


Fifty fragments of sieves, representing 34 vessels, were sampled from the Kuyavia 
region (Poland), of which 37 fragments representing 21 sieves were excavated in 
Ludwinowo 7 (Supplementary Table 1). A further 66 potsherds from Ludwinowo 
7 were analysed, of which 44 were cooking pots, the rest originating from collared 
flasks (n = 15) and bowls (n = 7, Supplementary Table 2). Lipid residue analyses 
and interpretations were based on established protocols'****”’. Briefly, ~1 to 3 g 
potsherds were sampled and their surfaces cleaned with a modelling drill to 
remove any exogenous lipids. The sherds were then ground to a powder, an 
internal standard added and solvent extracted by ultrasonication (chloroform/ 
methanol, 2:1 v/v, 2 X 10 ml). Solvent was evaporated under a gentle stream of 
nitrogen to obtain the total lipid extract. Aliquots of the total lipid extract were 
trimethylsilylated (N,O-bis(trimethylsilyl)trifluoroacetamide, 40 ul, 70°C, 1h) 
and submitted to analysis by gas chromatography and GC-MS. Further aliquots 
of the total lipid extract were treated with NaOH/H,0O (9:1 w/v) in methanol (5% 
v/v, 70°C, 1h). Following neutralization, lipids were extracted into chloroform 
(3 X 3 ml) and excess solvent evaporated under a gentle stream of nitrogen. Fatty 
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acid methyl esters were prepared by reaction with BF;-methanol (14% w/v, 70 °C, 
1h). Fatty acid methyl esters were extracted with chloroform (3 X 2 ml) and the 
solvent removed with a gentle stream of nitrogen. Fatty acid methyl esters were then 
re-dissolved into hexane for analysis by gas chromatography and GC-C-IRMS. 
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Current genomic perspectives on animal diversity neglect two 
prominent phyla, the molluscs and annelids, that together account 
for nearly one-third of known marine species and are important 
both ecologically and as experimental systems in classical embry- 
ology’ *. Here we describe the draft genomes of the owl limpet 
(Lottia gigantea), a marine polychaete (Capitella teleta) and a 
freshwater leech (Helobdella robusta), and compare them with 
other animal genomes to investigate the origin and diversifica- 
tion of bilaterians from a genomic perspective. We find that the 
genome organization, gene structure and functional content of 
these species are more similar to those of some invertebrate deu- 
terostome genomes (for example, amphioxus and sea urchin) than 
those of other protostomes that have been sequenced to date (flies, 
nematodes and flatworms). The conservation of these genomic 
features enables us to expand the inventory of genes present in 
the last common bilaterian ancestor, establish the tripartite diver- 
sification of bilaterians using multiple genomic characteristics and 
identify ancient conserved long- and short-range genetic linkages 
across metazoans. Superimposed on this broadly conserved pan- 
bilaterian background we find examples of lineage-specific genome 
evolution, including varying rates of rearrangement, intron gain 
and loss, expansions and contractions of gene families, and the 
evolution of clade-specific genes that produce the unique content 
of each genome. 

Molluscs, annelids and numerous smaller phyla typically share stereo- 
typed spiral cleavage patterns, cell-fate assignments and characteristic 
ciliated trochophore larvae, features that originated in the Precambrian 
era’ >. These spiralian phyla are included in the larger lophotrochozoan 
clade® that is a sister group to the ecdysozoans (arthropods, nematodes 
and other related phyla) but whose internal branching remains con- 
troversial. However, so far the only deeply sequenced lophotrocho- 
zoan genomes are those of platyhelminth flatworms (two parasitic 
schistosomes”* and a free-living planarian’), whose comparatively rapid 
rates of genome evolution do not reflect a general condition of lopho- 
trochozoans (see below). In this study, we explore spiralian diversity at 
the genomic level by comparative analysis of one mollusc and two 
annelid genomes (Supplementary Note 1). 

We assembled the limpet, polychaete and leech genomes from appro- 
ximately eight-fold random whole-genome shotgun coverage with Sanger 
dideoxy sequencing reads (Supplementary Note 2). No genetic or physical 
maps were available for these systems, so we reconstructed each genome 
as scaffolds (gap-containing sequences). The three genomes reported here 
each encode an estimated 23,000 to 33,000 protein-coding genes (‘Table 1, 


Supplementary Table 2.2.2 and Supplementary Note 2.2. The repetitive 
landscape of these genomes is discussed in Supplementary Note 3.2). 

Comparing the new genomes with other metazoan sequences, we 
characterized 8,756 modern bilaterian gene families as likely to have 
arisen from single progenitor genes in the last common bilaterian 
ancestor (Supplementary Note 3.4). As gene loss is common and highly 
diverged orthologues can be difficult to detect, this is a conservative 
lower bound on the number of genes encoded by the last common 
bilaterian ancestor. Of the 8,756 gene families, 763 were newly identified 
as being of bilaterian ancestry based on the new spiralian genomes 
(Supplementary Note 3.4). These newly identified bilaterian families 
belong to various functional categories (Supplementary Table 3.4.1), 
the most prominent being members of the G-protein-coupled recep- 
tor superfamily and epithelial sodium channels (see below) as well as 
various metabolic enzymes. Through subsequent gene duplication, the 
8,756 ancestral bilaterian families conservatively account for 47 to 85% 
of genes in other bilaterian species (70% of human genes; Supplemen- 
tary Note 3.4). Most of the remaining genes in extant bilaterian gen- 
omes share at least one domain with the bilaterian gene families, or have 
a significant BLAST (Basic Local Alignment Search Tool) hit when 
compared against sequences from bilaterian gene families, suggesting 
that they have arisen through descent with modification (Supplemen- 
tary Note 3.5). 

Exon-intron structures are highly conserved between spiralians and 
other animals; thus we infer that cis-splicing of intron-rich genes was the 
ancestral state of metazoans, bilaterians and protostomes (Supplemen- 
tary Note 5.2). In most cases, exon boundaries in the newly sequenced 
spiralians are precisely conserved between orthologous genes in 
sequenced deuterostomes (vertebrates, sea urchin and amphioxus) 
and non-bilaterians (Trichoplax and starlet sea anemone). For example, 
75% of human introns are present in one or more of the spiralians, 
whereas only 14% of the same introns are found in Drosophila'®". 
However, intron gain or loss rates vary markedly among the three 
spiralians. In particular, H. robusta also has substantially more novel 
introns than do the other two sequenced spiralians (Supplementary 
Notes 5.2 and 5.3, and Supplementary Fig. 5.2.1), the first of several 
indicators of a notably dynamic genome in this lineage. 

Collectively and individually, the spiralian genomes reported here 
retain most of the inferred ancestral bilaterian gene families (8,203 
out of 8,756, corresponding to a 94% retention rate, compared to 
7,553 or 86% retention rate in human). In contrast, the collective 
retention rate of only 65% for sequenced flatworms (53% for schisto- 
somes and 60% for Schmidtea) reflects the absence (and presumed 
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Species Size of genome Scaffold N50 Repetitive GC(%) Predicted Numberofgenesin Number of genesin Meannumberofexons Mean exon Mean 
assembly (Mbp) content(%) number orthologous clusters ancestral bilaterian per gene (with =2 length (bp) intron 
(Mbp) of genes __ with other species gene families orthologues) length (bp) 
Lottia gigantea 348 1.87 21 33 23,800 16,183 10,681 8 213 787 
Capitella teleta 324 0.19 31 40 32,389 20,537 11911 7 221 291 
Helobdella robusta 228 3.06 33 33 23,400 13,820 8,707 8 203 526 


GC, fraction of guanine plus cytosine nucleobases; Scaffold N50, the length such that half of the assembled sequence is in scaffolds longer than this length; Mbp, megabase pairs. 


loss) of more than 3,018 ancestral bilaterian gene families in these 
flatworms. Similar losses are observed for introns (Supplementary 
Note 5.3), as well as synteny (see below), which indicate a higher rate 
of genomic turnover in platyhelminths than in the mollusc and annelid 
genomes reported here. 

Against this background of conserved gene content and structure, 
we find several significantly (P < 0.05) expanded gene families in spe- 
cific spiralian clades (Supplementary Note 4.2). The sensory transduc- 
tion and signalling genes of the G-protein-coupled receptor (GPCR) 
superfamily in C. teleta are a prime example. All six of the rhodopsin- 
like GPCRs represented in the KEGG (Kyoto Encyclopedia of Genes 
and Genomes) neuroactive ligand-receptor interaction pathway are 
expanded in C. teleta (but not in H. robusta or L. gigantea), as are 
several other GPCRs (Supplementary Figs 4.3.2 and 4.4.1). Moreover, 
the C. teleta genome encodes 372 putative GPCR receptors that are 
most similar to peptide-binding GPCR subfamilies according to the 
family classification in the GPCR database (http://www.gpcr.org/7tm/ 
proteinfamily/). This number is considerably higher than that obtained 
for H. robusta (58), L. gigantea (113), Drosophila (32) or human (120) 
using the same methods (Supplementary Note 4.4). Most of these 
expansions occur as tandem duplicates. The C. teleta genome also 
shows an expansion of the calcium-signalling pathway downstream 
of GPCR (Supplementary Note 4.3). It is tempting to speculate that 
these expansions are related to the function of polychaete chemosen- 
sory structures such as antennae, palps and cirri (head sensory organs), 
and the nuchal organ’. Another notable feature of all three genomes is 
the presence of several atypical GPCRs with weak similarity to both 
vertebrate rhodopsin-like GPCRs and chemosensory receptors describ- 
ed previously in nematodes (Supplementary Fig. 4.4.1). Further studies 
are needed to determine whether these receptors can be classified as 
divergent members of previously described GPCR classes or whether 
they constitute novel groupings as described recently in planarians”’. 

We also find changes in gene content associated with sensory pro- 
cessing in the leech. These changes include expansion of the epithelial 
sodium channel (ENaC) receptor gene family that functions in the 
taste-transduction pathway (Supplementary Fig. 4.3.5), and the gap- 
junction-forming innexin gene family, as well as gene families involved 


in development (for example, homeobox genes (see below, and Sup- 
plementary Notes 4.5 and 4.6)). Both mollusc and annelid genomes are 
also enriched in specific metabolic enzymes and pathways of unknown 
relevance (for example, galactoside 2-«-L-fucosyltransferase; see Sup- 
plementary Notes 4.2 and 4.3). In general, lineage-specific gene family 
expansions seem to be the norm in the evolutionary diversification of 
modern taxa from the bilaterian ancestor, whereas the more ancient 
unicellular-metazoan“* and metazoan-bilaterian transitions are more 
notably marked by the acquisition of apparently novel (or highly 
divergent) gene families (Supplementary Table 3.5.1). 

We identified 231 putative spiralian-specific gene families whose 
members are readily aligned across all three spiralians (indicating 
purifying selection), but which lack obvious orthologues by BLAST 
in non-spiralian genomes (Supplementary Note 3.6). However, nearly 
two-thirds of these (188 out of 231; 62%) showed residual similarity to 
non-spiralian genes using more sensitive Hidden Markov Model 
methods, which suggests that they belong to ancient bilaterian gene 
families (Supplementary Note 3.6) that diverged extensively on the 
stem lineage leading from the bilaterian ancestor to the mollusc- 
annelid ancestor (‘type II’ novelties'’). The remaining 43 out of 231 
novel gene families are without any significant (E values of less than 
0.01) similarities outside of spiralians (‘type I novelties'*). More than 
one-half of the 231 spiralian novelties are transcribed based on existing 
expressed sequence tag (EST) evidence, with enriched expression in 
adult rather than embryonic tissues (Supplementary Notes 2.4 and 
3.7), hinting at roles in clade-specific adaptations beyond the early 
conserved stages of development. 

The inference of deep phylogenetic relationships among animal 
phyla is controversial but has benefitted from the use of multiple 
orthologous genes as phylogenetic markers'®'’. Recent EST-based 
studies provide broad taxonomic representation but rely on a limited 
number of available genes or are forced to accommodate a substantial 
amount of missing data®'*. In contrast, full genome sequences provide 
nearly complete sets of orthologues exempt from sampling bias, but 
can be more sensitive to long-branch attraction artefacts. 

To strike a balance between the number of phylogenetically infor- 
mative characters and possible long-branch artefacts we ranked 1,180 
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Figure 1 | Full-genome evidence resolves metazoan relationships and 
verifies the monophyly of lophotrochozoans and spiralians. a, A protein tree 
inferred from 299,129 amino acid positions gathered from 827 slow-evolving 
orthologues using RAxML and modelling heterogeneity of substitution 
processes using a LG + I'4 model with each gene partitioned. Strong support is 
obtained for the monophyly of lophotrochozoans. b, Intron tree obtained from 
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a matrix of 5,377 introns analysed using MrBayes and an asymmetric binary 
model (probability of gain: 0.01). c, Indel tree reconstructed from a matrix of 
1,928 indel sites using a regular binary model. Circles at nodes indicate a 
bootstrap support of >0.90 (a) or a posterior probability of >0.95 (b and c). In 
b and ¢, arrows indicate species that do not follow the protein family tree 
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clusters of orthologous genes (from 22 complete genomes) by their 
evolutionary rates (Supplementary Fig. 5.1.1) and identified a set of 
827 slowly evolving genes that include 299,129 aligned amino acid 
positions suitable for deep phylogenetic analysis. These characters 
strongly support the tripartite view of bilaterians and the monophyly 
of available lophotrochozoans (annelids, molluscs and platyhelminths) 
(Fig. 1a)"; the progressive addition of characters representing more 
rapidly evolving genes monotonically erodes support for this view, as 
expected under long-branch attraction (Supplementary Fig. 5.1.2). 
Although taxon sampling is generally considered critical to resolving 
deep phylogeny, our analyses show the importance of gene sampling. 
The rate-stratification approach introduced here could be used to place 
problematic taxa (for example, acoels, ctenophores and chaetognaths) 
when appropriate genome data becomes available. 

We also examined the phylogenetic signals in the gain and loss of 
introns, and insertions or deletions (indels) within coding sequences, 
incorporating spiralian sequences for the first time. Although few 
evolutionary reconstructions have been attempted with these charac- 
ters, they have attractive properties for phylogenetic analysis as change 
is rare and generally irreversible**”' (Supplementary Note 5.3). Phylo- 
genetic reconstruction using binary matrices encoding intron and 
indel presence or absence recovered the backbone of metazoan phylo- 
geny, and intron data provided strong support for grouping molluscs, 
annelids and platyhelminths (Fig. 1b). However, the analysis based on 
indels showed specific discrepancies relative to other data sets, notably 
the grouping of nematodes and platyhelminths (Fig. 1c). As this 
grouping is not consistent with either amino acid or intron analyses, 
we ascribe it to the accelerated genome evolution in these taxa and the 
low number of phylogenetically informative indel characters. 

All three trees possess short internal branches near the base of 
bilateria (see ref. 22), which indicates that the diversification into 
separate lophotrochozoan, deuterostome and ecdysozoan lineages 
was relatively fast, taking perhaps 30 to 80 million years (Myr) (com- 
parable to the diversification of mammals; Supplementary Note 5.5). 

Wealso sought evidence for genome-wide functional diversification 
across metazoan genomes using principal component analysis (Sup- 
plementary Note 4.1). Remarkably, this phenetic approach grouped 
the newly sequenced mollusc and annelids with invertebrate deutero- 
stomes (amphioxus, sea urchin and sea squirt) and non-bilaterian 
metazoan phyla (cnidarian, placozoan and demosponge) (Fig. 2). 
Given that this grouping includes both bilaterians and non-bilaterian 
metazoans, cladistic logic implies that these gnomes approximate the 
ancestral bilaterian (and metazoan) genomic repertoire. In contrast, 
vertebrate genomes form a distinct cluster, and are thus functionally 
derived relative to this ancestral bilaterian state, partly owing to the 
diversification of genes related to the vertebrate innate and adaptive 
immune system that dominate the loadings of principal component 1 
(PCI, Fig. 2) (Supplementary Table). The functional coherence of the 
genes that differentiate currently available ecdysozoan genomes 
through PC2 is unclear. Although this analysis may be skewed by 
the more complete functional annotation of vertebrates and classical 
model systems, other similar analyses less dependent on function con- 
firm the clear separation of vertebrates from other metazoan genomes 
(Supplementary Note 4.1). 

The L. gigantea and C. teleta genomes show extensively conserved 
macrosynteny with each other, with chordates (including human; see 
Fig. 3 and Supplementary Note 6) and with several other extant meta- 
zoan lineages (sea anemone", placozoan”’ and demosponge”). In our 
analyses, conserved macrosynteny requires only conserved linkage 
between orthologous genes, and is independent of intra-chromosome 
rearrangements (that is, scrambling of gene order) that are typical in 
phylogenetically deep comparisons'*’>”’. Conserved macrosynteny in 
L. gigantea and C. teleta involves nearly one-half of the conserved 
protein coding genes in these species (Supplementary Note 6 and Sup- 
plementary Table 6.3.1). In contrast, we found no significant conserva- 
tion of macrosynteny between H. robusta and other species, implying 
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extensive reorganization in the leech genome relative to the last com- 
mon spiralian ancestor. 

The observed conserved macro-synteny demonstrates the persis- 
tence of 17 ancient bilaterian ancestral linkage groups (ALGs) in the 
common ancestor of L. gigantea and C. teleta. Independent fusions 
(two in L. gigantea, three in C. teleta) subsequently reduced the num- 
ber of bilaterian ALGs that remain distinct in these genomes. The 
conservation of 17 bilaterian ALGs among L. gigantea, C. teleta and 
various deuterostomes implies that the last common protostome and 
deuterostome ancestors also had this organization. Some ecdysozoans 
like Caenorhabditis elegans (soil nematode), Tribolium castaneum 
(beetle) and Bombyx mori (moth) (Supplementary Note 6) also show 
clear evidence of conserved macrosynteny. However, the large number 
of chromosome fusions and rearrangement events make similar recon- 
struction of the ancestral ecdysozoan ALGs impossible with current 
data (Supplementary Note 6). 

Remarkably, we can also use the L. gigantea and C. teleta genomes to 
infer ancient translocations between linkage groups (Supplementary 
Note 6). For example, L. gigantea and C. teleta share a translocation 
relative to the last common bilaterian ancestor (Fig. 3), indicating that 
this genomic rearrangement occurred on the stem lineage leading 
from the bilaterian to the mollusc—annelid node. As noted above, more 
recent translocations that are not shared between L. gigantea and 
C. teleta are also evident (Supplementary Note 6). It remains unclear 
whether these genome reorganizations were causally involved in the 
radiation of diverse bilaterian lineages, or were simply neutral changes. 
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Figure 2 | Clustering of metazoan genomes in a multidimensional space of 
molecular functions. The first two principal components are displayed, 
accounting for 20% and 15% of variation, respectively. At least three clusters are 
evident, including a vertebrate cluster (far right), a non-bilaterian metazoan, 
invertebrate deuterostome or spiralian cluster (centre, top), and an ecdysozoan 
group (lower left). Drosophila and Tribolium (lower left) are outliers. Aqu, 
Amphimedon queenslandica (demosponge); Bfl, Branchiostoma floridae 
(amphioxus) ; Cel, Caenorhabditis elegans ; Cte, Capitella teleta (polychaete); 
Cin, Ciona intestinalis (sea squirt); Dme, Drosophila melanogaster; Dpu, 
Daphnia pulex (water flea); Dre, Danio rerio (zebrafish); Isc, Ixodes scapularis 
(tick); Gga, Gallus gallus (chicken) ; Hsa, Homo sapiens (human); Hma, Hydra 
magnipapillata; Hro, Helobdella robusta (leech); Lgi, Lottia gigantea (limpet); 
Mmu, Mus musculus (mouse); Nve, Nematostella vectensis (sea anemone); Sma, 
Schistosoma mansoni; Sme, Schmidtea mediterranea (planarian); Spu, 
Strongylocentrotus purpuratus (sea urchin); Tad, Trichoplax adhaerens 
(placozoan); Tca, Tribolium castaneum (flour beetle); Xtr, Xenopus tropicalis 
(clawed frog). 
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Figure 3 | Macrosynteny between spiralians, humans and Trichoplax. 

a, The location of genes in scaffolds of L. gigantea, C. teleta and Trichoplax (a 
non-bilaterian outgroup that conserves synteny) relative to the position of their 
orthologues in the human genome. The human chromosome segments have 
been grouped according to their ancestral linkage group (ALG); chromosome 
segment identifiers are also shown (see ref. 10). Human genes in ALG 2 have 
their orthologues co-located on a limited set of scaffolds in L. gigantea, C. teleta 
and Trichoplax, indicating conserved linkage of this group of genes across 
eumetazoan lineages. In contrast, although ALG 17 and ALG 9 are preserved 
separately in Trichoplax, scaffolds of L. gigantea and C. teleta have homologous 
gene content either with ALG 9 or with both ALG 9 and ALG 17, indicating a 


The most famous example of conserved microsynteny—conserved 
tight linkages between orthologous genes—is the Hox complex, an 
ancient cluster of homeodomain-containing transcription factors with 
conserved roles in patterning the anteroposterior body axis of animals”. 
In L. gigantea, 11 Hox genes occur as a single cluster that is structurally 
collinear with intact Hox clusters found in other genomes, and is the 
first intact cluster found in a lophotrochozoan (Fig. 4). C. teleta Hox 
genes occur in one-to-one correspondence with their L. gigantea coun- 
terparts but lie on three scaffolds, with the scaffold harbouring the 
posterior class gene post! clearly disconnected from the main cluster”. 
We therefore infer that the last common mollusc—annelid ancestor had 
a single 11-gene Hox cluster (Supplementary Note 8) with 3 anterior- 
and 6 central-class genes, plus 2 posterior-class genes (post1 and post2) 
that arose by duplication along the spiralian (or lophotrochozoan) stem 
lineage’®. In contrast, the Hox complex of H. robusta has fragmented 
extensively, consistent with the general loss of synteny conservation in 
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translocation of one or more chromosome segments from ALG 17 to ALG 9 in 
the common ancestor of molluscs and annelids, after the divergence of the 
spiralian and vertebrate lineages. Genes inferred to derive from this 
translocated segment are shown in red. Subsequent intra-chromosomal 
rearrangement has dispersed the translocated genes among the genes of ALG 9. 
b, The scenario in panel a represented schematically on a phylogenetic tree, 
with chromosomes of ancestral and living genomes represented as horizontal 
blue bars and the translocated segment represented in red. c, The positions of 
human genes and their L. gigantea, C. teleta and Trichoplax adhaerens 
orthologues compared in dot plots schematically (and in the real data; see 
panel a) for three ALGs. 


H. robusta, and there have been multiple duplications and loss of two 
mollusc-annelid paralogy groups (the orthologues of the anterior-class 
proboscipedia and post1). Intriguingly, although the gene rearrange- 
ments observed in H. robusta are as extreme as in C. elegans, the leech 
is not particularly derived with respect to other genomic characters. 
This lineage may therefore be an interesting model for focused studies 
on rapid evolution of gene order. We also find other tightly linked groups 
of anciently duplicated (that is, paralogous) genes, including clusters 
of deeply diverged gene superfamilies such as the homeodomain”, 
forkhead box” and wingless”* gene families that duplicated extensively 
before the bilaterian radiation but have remained linked (Supplemen- 
tary Note 7.4). 

Overall, we found hundreds of other examples of conserved micro- 
syntenic blocks involving thousands of genes in L. gigantea, C. teleta 
and other metazoan genomes (Supplementary Note 7.1). We consider 
a microsyntenic block to be a group of three or more genes whose 
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Figure 4 | The Hox gene complement and linkage in the three 
lophotrochozoan genomes and selected bilaterians. Arrows indicate 
direction of transcription (orientation between scaffolds is arbitrary). Scaffolds 
with ends marked by black dots may be part of a larger Hox complex because 


the Hox gene is at the end of the scaffold. B. floridae, Branchiostoma floridae. 
Colours indicate unambiguously assigned paralogy groups (Hox1, Hox2, Hox3, 
Hox4, central class and posterior class). 
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Figure 5 | Examples of conserved orthologous gene clusters. a-c, Clusters of 
linked genes across diverse species. Within each panel, genes in the same colour 
are members of the same orthologous group, with the gene identifiers of 
defining members of the group indicated: C7orf42, (human) chromosome 7 
open reading frame 42; Exoc7, exocyst complex component 7; FAM100, family 
with sequence similarity 100; FoxJ, forkhead box protein J; HDC, histidine 
decarboxylase; MGRN E3 Ub ligase, mahogunin ring finger E3 ubiquitin ligase; 
MORC, MORC family CW-type zinc finger; PSMG1, proteasome (prosome, 
macropain) assembly chaperone 1; ROX/MINT, Max-binding protein family 
member; SYAP1, synapse-associated protein 1; WBP11, WW domain binding 


orthologues are tightly linked (that is, separated by no more than ten 
intervening genes) in two or more genomes. Microsyntenic blocks are 
often, but not always, embedded in a conserved macrosyntenic con- 
text. The count of 469 microsyntenic blocks that are putatively pre- 
served from the bilaterian ancestor in at least one protostome and one 
deuterostome genome is substantially greater than the 157 blocks that 
would persist by chance in a simple model of genome rearrangement in 
which gene order is randomized within the macrosynteny blocks 
defined above (Supplementary Notes 6 and 7), implying either func- 
tional constraint on genome organization or intrinsically slow rates of 
rearrangement in some genomic regions. Considering the deeply 
diverged bilaterian lineages represented by L. gigantea, C. teleta and 
amphioxus (Supplementary Table), we found 77 conserved microsyn- 
tenic blocks (Supplementary Note 7.1), which in some cases are stably 
conserved across other metazoan genomes (Fig. 5). It is tempting to 
speculate that these conserved linkages are due to selection for preserv- 
ing complex cis-regulatory landscapes (Supplementary Note 7.2). 

Although molluscs and annelids are related to flies, nematodes and 
flatworms within the protostomes, we find that their genomes are in 
many ways more similar to those of invertebrate deuterostomes (such 
as amphioxus and sea urchin) as well as non-bilaterian metazoans 
(such as cnidarians, sponges and placozoans). These similarities reveal 
features of bilaterian and/or metazoan genomes that have been lost or 
diverged in many protostome genomes reported so far, and thus 
enable a more complete reconstruction of genomic features of the last 
common ancestors of protostomes, bilaterians and metazoans, includ- 
ing gene and chromosome structure and organization. Superimposed 
on these conserved features are evolutionary innovations—novel gene 
families and gene-family expansions and losses, as well as large- and 
small-scale genomic rearrangements—that make each clade unique. 
Nearly 20 other phylum-level taxa lack even a single genome sequence, 
and intra-phylum genomic variation can be extensive. Thus, for a 
comprehensive genomic understanding of the metazoan radiation a 
far larger sampling of genomes will be needed. 


METHODS SUMMARY 

Gene families and phylogeny. Orthology relationships were reconstructed for 
22 metazoan genomes (Supplementary Fig. 3.3.1) using a phylogenetic clustering 
approach, which progressively examined reciprocal best scoring BLAST hits at 
decreasing phylogenetic nodes ofa reference animal tree. We recovered 1,235 gene 
families with orthologous members in all genomes. To assess the effect of fast and 
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protein 11; ZMPSTE24, zinc metallopeptidase, STE24 homologue. Scaffold 
positions for all displayed linkages are listed in Supplementary Note 7.2. 

d, Cumulative rates of microsynteny change plotted on a fixed metazoan tree 
topology. Branch lengths are proportional to the number of inferred genomic 
rearrangements. A. queenslandica, Amphimedon queenslandica; C. intestinalis, 
Ciona intestinalis; D. melanogaster, Drosophila melanogaster; D. pulex, 
Daphnia pulex; D. rerio, Danio rerio; G. gallus, Gallus gallus; H. magnipapillata, 
Hydra magnipapillata; I. scapularis, Ixodes scapularis; M. musculus, Mus 
musculus; N. vectensis, Nematostella vectensis; Ub, ubiquitin; X. tropicalis, 
Xenopus tropicalis. 


slow evolving characters on the tree topology, several phylogenetic approaches 
were taken (see Supplementary Note 5). 

Identification of 8,756 ancestral bilaterian genes. Gene families were considered 
to be ancestral bilaterian gene families when an orthologous group had at least two 
protostome and two deuterostome representatives (in-group) or two sequences from 
either in-group and two from basal (that is, non-bilaterian) metazoans (out-group). 
Macrosynteny. Draft genome scaffolds were clustered into ancestral linkage 
groups (ALGs) based on the locations of orthologous genes in other metazoan 
genomes, as described previously’? We iteratively constructed a parsimonious 
scenario of chromosome evolution, and ancestral genes were assigned to ancestral 
ALGs when any other assignment would imply more hops between ALGs in the 
history of that gene family (Supplementary Note 6). 

Microsynteny. Chromosomal locations of orthologous genes in two different 
species were compared. If another set of orthologous genes is identified within a 
maximal distance of 10 genes of the previous set, both sets were merged together 
into a microsyntenic block. Only syntenic blocks with at least three orthologues 
per species were considered. 

Intron and indel identification and phylogeny. Gene families with a maximum 
of 2 missing species (out of 22) were included. Intron and indel positions were 
detected using conserved flanking sites (3 out of 8 amino acids), no gaps were 
allowed to flank introns. For indels, the flanking amino acids had to be conserved. 
Phylogenetic inference based on presence or absence was computed with MrBayes 
as described in Supplementary Note 5.3. 
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Rapid regulation of depression-related behaviours 
by control of midbrain dopamine neurons 
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Ventral tegmental area (VTA) dopamine neurons in the brain’s reward 
circuit have a crucial role in mediating stress responses’ *, including 
determining susceptibility versus resilience to social-stress-induced 
behavioural abnormalities’. VTA dopamine neurons show two in vivo 
patterns of firing: low frequency tonic firing and high frequency 
phasic firing®*. Phasic firing of the neurons, which is well known to 
encode reward signals®”’, is upregulated by repeated social-defeat 
stress, a highly validated mouse model of depression®*"*". Surpri- 
singly, this pathophysiological effect is seen in susceptible mice only, 
with no apparent change in firing rate in resilient individuals”*. 
However, direct evidence—in real time—linking dopamine neuron 
phasic firing in promoting the susceptible (depression-like) phenotype 
is lacking. Here we took advantage of the temporal precision and cell- 
type and projection-pathway specificity of optogenetics to show that 
enhanced phasic firing of these neurons mediates susceptibility to 
social-defeat stress in freely behaving mice. We show that optogenetic 
induction of phasic, but not tonic, firing in VTA dopamine neurons 
of mice undergoing a subthreshold social-defeat paradigm rapidly 
induced a susceptible phenotype as measured by social avoidance 
and decreased sucrose preference. Optogenetic phasic stimulation of 
these neurons also quickly induced a susceptible phenotype in previ- 
ously resilient mice that had been subjected to repeated social-defeat 
stress. Furthermore, we show differences in projection-pathway spe- 
cificity in promoting stress susceptibility: phasic activation of VTA 
neurons projecting to the nucleus accumbens (NAc), but not to the 
medial prefrontal cortex (mPFC), induced susceptibility to social- 
defeat stress. Conversely, optogenetic inhibition of the VTA-NAc 
projection induced resilience, whereas inhibition of the VTA-mPFC 
projection promoted susceptibility. Overall, these studies reveal novel 
firing-pattern- and neural-circuit-specific mechanisms of depression. 

To selectively target VIA dopamine neurons, we injected a double- 
floxed (DIO) Cre-dependent adeno-associated virus (AAV) vector 
expressing channelrhodopsin-2 (ChR2) fused with enhanced yellow 
fluorescent protein (eYFP) (AAV-DIO-ChR2-eYFP) into the VTA of 
tyrosine hydroxylase (TH)-Cre transgenic mice”"*. We first validated 
the specificity and efficacy of AAV-DIO-ChR2-eYFP expression in 
VTA dopamine neurons of TH-Cre mice in vivo (Fig. la, b). Next, 
functional validation of ChR2-eYFP expression in VITA dopamine 
neurons using in vitro and in vivo electrophysiological recordings 
confirmed that optogenetic stimulation enables precise temporal con- 
trol of these neurons (Supplementary Fig. la—c) and can be used to 
mimic the in vivo pathophysiological upregulation in phasic firing seen 
in the social-defeat-stress model of depression*””. 

To investigate the functional consequence of the increase in phasic 
firing of VITA dopamine neurons observed after repeated (10-day) 
social-defeat stress, we examined the effect of optogenetically inducing 


tonic (0.5-Hz) or phasic (20-Hz) firing (Fig. 1c; 5 spikes for each 10 s) 
in ChR2-expressing VTA dopamine neurons of TH-Cre mice while 
undergoing subthreshold exposure to social defeat (Fig. 1d). Mice that 
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Figure 1 | Phasic, but not tonic, optical stimulation of VTA dopamine 
neurons during a subthreshold social defeat induces a susceptible 
phenotype. a, Confocal image showing co-expression of AAV-DIO-ChR2 in 
TH-positive dopamine cells from TH-Cre mice. b, Quantification shows that 
ChR2-expressing TH-positive cells are 62 + 4% of total TH-positive neurons in 
the VTA. No expression of ChR2 was found in TH-negative neurons 

(n = 2-3 sections from n = 4 animals). c, Optical stimulation protocols for 
mimicking tonic (0.5-Hz) or phasic (20-Hz) firing. Note that for both stimulating 
protocols, 5 spikes are induced over each 10-s period. d, Top panel, experimental 
timeline. Bottom panel, detailed schematic of the subthreshold paradigm showing 
laser stimulation during social defeat. e, Social-interaction data from control, tonic 
and phasic groups. (F2,39 = 4.70, P < 0.05; post hoc test, *P < 0.05; n = 7-14). 

f. Sucrose preference measured over a 12-h period after the social-interaction test 
(F522 = 5.22, P< 0.05; post hoc test, *P < 0.05; n = 7-10). Error bars, + s.e.m. 
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undergo this subthreshold paradigm do not exhibit social avoidance or 
other depression-like behaviours (Supplementary Fig. 1d), but they are 
more vulnerable to subsequent stress*’*. We stimulated VTA dopa- 
mine neurons during the subthreshold defeat paradigm and measured 
social interaction and sucrose preference as two sequelae of defeat 
stress: social avoidance and reduced sucrose preference characterize 
the susceptible phenotype induced by our repeated (10-day) social- 
defeat paradigm**"°. Mice that received phasic stimulation exhibited a 
robust increase in the depression-like phenotype as indicated by the 
significant decrease both in social interaction in the presence of a tar- 
get mouse (Fig. le and Supplementary Fig. le-g) and in sucrose pre- 
ference (Fig. 1f) compared to both tonic stimulated ChR2-eYFP mice 
and phasic stimulated eYFP control mice. These data confirm the 
functional importance of increased phasic, but not tonic, firing of 
VTA dopamine neurons during exposure to stress for promoting sus- 
ceptibility for depression-like behavioural abnormalities. 

To test directly the causal link between phasic dopamine-neuron 
firing and stress susceptibility, mice were exposed to subthreshold 
defeat and then to stimulation of VTA dopamine neurons during 
the social-interaction test (Supplementary Fig. 2a). Phasic stimulation 
of VTA dopamine neurons instantly induced a susceptible phenotype 
(increased social avoidance) during the social-interaction test in the 
presence of a target CD1 mouse (or a target C57 mouse, Supplemen- 
tary Fig. 3), an effect not seen in tonic stimulated ChR2 mice or phasic 
stimulated eYFP control mice (Fig. 2a and Supplementary Fig. 2b-d). 
ChR2-expressing mice that received phasic stimulation during the 
brief 2.5-min social-interaction test also showed reduced sucrose pref- 
erence compared to tonic-stimulated ChR2 mice or phasic-stimulated 
eYFP control mice (Fig. 2b). These findings are striking because they 
reveal the rapid induction of such depression-like behaviours that 
normally require repeated social-defeat stress. 

In contrast to the ability of VTA dopamine-neuron stimulation to 
promote depression-like behaviours either during or after social-defeat 
stress, we found that phasic stimulation of these neurons in naive animals 
had no effect on social interaction, on sucrose preference or on baseline 
anxiety-related measures (Supplementary Fig. 4). These findings show 
that the pro-depression-like effects of VTA dopamine-neuron activation 
demonstrated here are context-specific, an important finding in light of 
the pro-reward consequences of such dopamine-neuron activation in 
other systems®””. Furthermore, chronic mild stressors or physically aver- 
sive stimuli inhibit the activity of VTA dopamine neurons, whereas more 
severe stressors increase this activity'*”’, as is the case with severe social 
stress. Also, chronic mild stress and chronic social-defeat stress have been 
shown recently to produce different changes in extracellular levels of se- 
veral neurotransmitters in a number of brain areas’®. These findings raise 
the possibility that, in addition to context, the severity of stress is another 
important determinant of stress regulation of dopamine-neuron firing. 

We next investigated whether phasic firing of VTA dopamine neu- 
rons could convert resilient mice to susceptible mice. Using our stan- 
dard repeated (10-day) social-defeat paradigm**", we identified the 
subgroup of mice that remained resilient based on normal social- 
interaction scores, which we know are highly correlated with normal 
sucrose preference and other behavioural and biochemical endpoints’. 
Our previous work had shown that VTA dopamine neurons of resilient 
mice display normal firing rates and patterns due to unique adapta- 
tions within these neurons that prevent the stress-induced increase in 
their excitability**. To test the effect of increased phasic stimulation of 
VTA dopamine neurons in resilient mice, we switched to herpes sim- 
plex virus (HSV)-ChR2 and control vectors with confirmed immuno- 
histochemical and functional validation (Supplementary Fig. 5a-d). 

HSV vectors have the advantage of expressing their transgenes very 
rapidly (within 24h) compared to AAV vectors, which require 10- 
14days for maximal expression*'*"*. In this experiment, wild-type 
mice were subjected to the standard repeated social-defeat stress 
and then a social-interaction test on day 11 to identify the resilient 
mice as noted above. Resilient mice were injected with HSV-eYFP or 
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Figure 2 | Phasic optical stimulation of VTA dopamine neurons during the 
social-interaction test instantly induces a susceptible phenotype in two 
social-defeat paradigms. a, Social-interaction data in control, tonic and phasic 
groups (F2,2. = 4.00, P< 0.05; post hoc test, *P < 0.05; n = 7-11). Blue light 
stimulation was used during the tests (blue horizontal bars in a and 

c). b, Sucrose preference measured over a 12-h period after the 
social-interaction test (F,5 = 3.47, P< 0.05; post hoc test, *P < 0.05; 

n = 8-11).c, Social-interaction data measured on day 17 (t)5 = 2.72, *P < 0.05; 
two-tailed t-test, n = 11-18). d, Sucrose preference measured over a 12-h 
period after the social-interaction test (t)7 = 2.34, *P < 0.05; two-tailed t-test, 
n= 6-12). e, Sample traces showing in vitro spontaneous activity of VTA 
dopamine neurons from TH-Cre mice that underwent tonic and phasic 
stimulation during the social-interaction test 24h after subthreshold social 
defeat (see Supplementary Fig. 5j for the experimental timeline). Bar graph, 
comparison of spontaneous firing in VTA dopamine neurons from eYFP 
control, tonic and phasic-stimulated mice (F359 = 3.19, P< 0.05; post hoc test, 
*P < 0.05; n = 17-19). f, Significantly less current was required to evoke a 
single spike in phasic-stimulated mice compared to eYFP control mice 

(t21 = 1.8, *P < 0.05; one-tailed t-test, n = 12-16). g, VTA dopamine 

cells from phasic-stimulated mice display greater overall increased cell 
excitability in response to incrementally increased current injections 

(50, 100, 150 and 200 pA) than eYFP control and tonic-stimulated mice 
(F549 = 16.13, P< 0.001; post hoc test, *P < 0.05 **P < 0.005; n = 5-17). 
Error bars, + s.e.m. 
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HSV-ChR2-eYFP and subsequently analysed in a second social- 
interaction test during which time they underwent optogenetic phasic 
stimulation (Supplementary Fig. 5e, f). Although optogenetically 
stimulated HSV-eYFP-injected mice remained resilient, optically 
induced phasic firing of the VTA of HSV-ChR2-eYFP injected mice 
instantly converted their behavioural phenotype from resilient to sus- 
ceptible as evidenced by the decrease in time spent in the interaction 
zone (Fig. 2c and Supplementary Fig. 5g-i). Furthermore, such phasic 
firing of the VTA induced anhedonic traits as evidenced by decreased 
sucrose preference compared to HSV-eYFP-injected mice (Fig. 2d). 
Given the striking impairment in sucrose preference approximately 
12 hafter the optogenetic activation of VTA neurons, we proposed that 
such stimulation might induce lasting changes in VTA dopamine- 
neuron excitability. To investigate this hypothesis, we measured intrin- 
sic membrane properties of these neurons of TH-Cre mice that had 
previously undergone the subthreshold social-defeat paradigm fol- 
lowed by in vivo optogenetic stimulation during the social-interaction 
test (Supplementary Fig. 5j). Phasic stimulation induced increased 
VTA dopamine-neuronal excitability compared to eYFP and tonic- 
stimulated mice as measured by increased spontaneous (Fig. 2e) and 
evoked (Fig. 2f, g) activity 8-12 h after optical stimulation. These find- 
ings suggest that subthreshold defeat followed by acute optogenetic 
activation leads to long-lasting neuroadaptations in VTA dopamine neu- 
rons that underlie the sustained decrease in sucrose preference observed. 
VTA dopamine neurons project broadly throughout the brain. The 
role of the VTA-NAc pathway in reward is well known®”, but it has 
also been implicated in stress responses, as has the VTA—mPFC path- 
way'**!°!9. Therefore we were interested in investigating the role of 
these two distinct VTA projection pathways in promoting the suscep- 
tible and resilient phenotypes. We first investigated the VTA-NAc 
circuit by specifically labelling VTA neurons projecting to the NAc; 
this was accomplished by injecting the retrograde green fluorescent 
tracer lumafluor into the NAc and measuring the firing rate of 
dye-positive VTA-NAc neurons in control, susceptible and resilient 
mice after repeated (10-day) social-defeat stress (see Supplementary 
Fig. 6a, b for anatomical validation). We found that NAc-projecting 
VTA dopamine neurons in brain slices from susceptible mice showed 
a significantly higher firing rate compared to those of control and 
resilient mice (Fig. 3a, b), which is in agreement with increased phasic 
firing events from our previous work**. Next we used optogenetic 
techniques to selectively stimulate this VTA—NAc pathway. To do this, 
we injected a replication-defective version of the retrograde travelling 
pseudorabies virus expressing Cre (PRV-Cre) into the NAc and the 
Cre-dependent AAV-DIO-ChR2-eYFP into the VTA of wild-type 
mice (Supplementary Fig. 6c). Immunohistochemical and electro- 
physiological validation confirmed the viability of using PRV-Cre to 
express functional ChR2 in VTA cells projecting to the NAc (Sup- 
plementary Fig. 6d—-h). Mice were then subjected to the subthreshold 
social-defeat paradigm and 24h later were optogenetically stimulated 
during the social-interaction test (Supplementary 6i). Optical induc- 
tion of phasic, but not tonic, firing in VTA-NAc neurons induced the 
susceptible phenotype as measured by increased social avoidance and 
decreased sucrose preference (Fig. 3c, d and Supplementary 6j, k). 
Although the vast majority of dopamine neurons in this pathway were 
optically stimulated, only a small number of non-dopamine neurons 
were labelled with our approaches. Further work will be needed to study 
any potential influence of these non-dopamine cells on stress responses. 
To investigate the effect of inhibiting the VTA-NAc pathway on the 
expression of the stress response, we injected AA V-DIO-halorhodopsin 
(NpHR)-eYFP (version 3.0 of NpHR) into the VTA followed by PRV- 
Cre in the NAc (Supplementary Fig. 7a, b). Functional validation con- 
firmed that yellow light (563 nm) reliably inhibited VTA neuronal 
firing both in vitro and in vivo (Supplementary Fig. 7c, d) and, impor- 
tantly, we did not observe any NpHR-activated rebound effect in these 
neurons as a result of the light protocol that we used during the beha- 
vioural experiments (Supplementary Fig. 7e). Furthermore, anatomical 
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quantification showed robust NpHR expression in VTA cells projecting 
to NAc (Supplementary Fig. 7f, g). Mice were first put through the 
standard repeated social-defeat stress paradigm followed by a social- 
interaction test on day 11 to identify the susceptible mice for eYFP 
control and NpHR groups (Supplementary Fig. 7h). Optogene- 
tically stimulated DIO-eYFP injected mice remained susceptible in 
the second social-interaction test (both during target presence and 
absence), whereas inhibition of the VITA-NAc projection of DIO- 
NpHR-eYFP- injected mice instantly induced the resilient phenotype 
as evidenced by the increase in both the amount of time spent in the 
interaction zone and preference for sucrose (Fig. 3e, f and Supplemen- 
tary Fig. 7i, j). These mice had undergone a chronic social-defeat para- 
digm and an earlier social-interaction test. It is therefore likely that the 
optogenetic attenuation of the increased activity of the VTA-NAc 
pathway that is typically seen in susceptible mice, combined with reacti- 
vation of the memory of the previous social-interaction test, resulted in 
those NpHR-injected mice spending longer times in the interaction 
zone, both in the absence and presence of a target mouse, during the 
second social-interaction test. 

We also carried out experiments to investigate the functional im- 
portance of mPFC-projecting VTA dopamine neurons in promoting a 
susceptible phenotype (Fig. 4). We first labelled VTA neurons projec- 
ting to mPFC by injecting the retrograde red fluorescent tracer luma- 
fluor into the mPFC (Supplementary Fig. 8a, b). Surprisingly, the firing 
rate of VTA-mPFC neurons was dramatically decreased in brain slices 
obtained from susceptible mice after repeated (10-day) social-defeat 
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Figure 3 | Bidirectional effect of modulating the VTA-NAc pathway on 
susceptibility to social defeat. a, Sample traces recorded from VTA-NAc 
neurons in VTA slices. b, Firing rates of VTA-NAc neurons from control, 
resilient and susceptible mice (F2,g9 = 15.77, P< 0.001; post hoc test, 

*P < 0.001; n = 12-52). ¢, Social-interaction data obtained during optical 
stimulation of VTA-NAc neurons in control, tonic and phasic groups 

(Fo,29 = 4.43, P < 0.05; post hoc test, *P < 0.05; n = 5-10). Blue light 
stimulation was used during the tests (blue horizontal bars). d, Sucrose- 
preference data measured over a 12-h period after the social-interaction test 
(F213 = 4.80, P < 0.05; post hoc test, *P < 0.05; n = 5-9). e, Social interaction 
during optical inhibition of VTA-NAc neurons in previously susceptible mice 
(No target: t); = 4.2, *P < 0.001; two-tailed t-test, n = 8-9; Target: t; = 2.6, 
*P < 0.05; two-tailed t-test, n = 8-9). Yellow light stimulation was used during 
the tests (yellow horizontal bars). f, Sucrose preference measured over a 12-h 
period after the social-interaction test (t,4 = 2.3, *P < 0.05; two-tailed t-test, 
n = 8-9). Error bars, + s.e.m. 
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stress, compared to that of control and resilient mice (Fig. 4a, b). This is 
consistent with a recent report of decreased extracellular dopamine 
levels in this region after repeated social-defeat stress’*. We next used 
optogenetic techniques to selectively stimulate the VTA—-mPFC path- 
way. To do this, we injected the retrograde PRV-Cre viral vector into 
the mPFC and the Cre-dependent AAV-DIO-ChR2-eYFP into the 
VTA of wild-type mice (Supplementary Fig. 8c), and confirmed the 
viability of using PRV-Cre to express functional ChR2 in VTA cells 
projecting to mPFC (Supplementary Fig. 8d-h). Optically induced pha- 
sic firing of these VTA-mPFC neurons, during the social-interaction 
test in mice that had previously undergone subthreshold social defeat, 
had no effect on social interaction and sucrose preference, compared to 
tonic stimulation and control viral vectors (Fig. 4c, d and Supplemen- 
tary Fig. 8i, j). We next investigated the effect of inhibiting the VTA- 
mPFC pathway on the expression of the stress response by injecting 
DIO-NpHR-eYFP into the VTA followed by PRV-Cre into the mPFC 
(Supplementary Fig. 9a—-d). In vivo optical inhibition of the VTA- 
mPFC pathway by activation of NpHR, during the social-interaction 
test in mice that had previously undergone subthreshold social defeat, 
induced the susceptible phenotype as measured by decreased social 
interaction, an effect that is not seen in control mice (Fig. 4e and Sup- 
plementary Fig. 9e, f). Interestingly, there was no difference in sucrose 
preference (Fig. 4f). 

Our study establishes a direct link between VTA dopamine-neuronal 
firing patterns and susceptibility to a depression-related phenotype, in 
cases in which phasic firing of NAc-projecting VTA dopamine neurons 
encodes a signal for susceptibility. This finding is consistent with the 
proposed role of VTA-NAc (mesolimbic) dopamine-neuronal phasic 
firing in this stress model: we previously showed that the VTA-NAc 
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Figure 4 | Effect of modulating the VTA-mPFC pathway on susceptibility 
to social defeat. a, Sample traces recorded from VTA-mPFC neurons in VTA 
slices. b, Firing rates of VTA-mPFC neurons from control, resilient and 
susceptible mice (F,33 = 15.07, P = 0.0005; post hoc test, *P < 0.005; n = 11- 
15). c, Social-interaction data obtained by optical stimulation of VTA-mPFC 
neurons in control, tonic and phasic groups (F2,39 = 1.29, P = 0.29; n = 11-17). 
d, Sucrose-preference data measured over a 12-h period after the social- 
interaction test (F>,33 = 0.19, P = 0.82; n = 10-16). e, Social-interaction data 
obtained during optical inhibition of VTA-mPFC neurons (ty9 = 2.5, 

*P < 0.05; two-tailed t-test, n = 12-19). f, Sucrose preference measured over a 
12-h period after the social-interaction test (ty) = 0.38, P > 0.05, n = 12-19). 
Error bars, + s.e.m. 
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pathway is a key determinant of susceptibility versus resilience to 
repeated social-defeat stress*, and that the pathophysiological increase 
in phasic firing occurs selectively in susceptible mice®. The observation 
that VTA dopamine-neuron phasic firing has a functional role in 
encoding for both depression-like symptoms after social-defeat stress, 
and conditioned place preference’, suggests that firing patterns of these 
neurons and the subsequent encoding of the depressive or reward 
phenotype are highly context-dependent. Such encoding of stress ver- 
sus reward is more complex still, because it is not only firing-pattern 
selective’ but also dependent on the severity of stress (as noted above) 
and shows clear projection-pathway specificity”. 

In addition, our observation of the rapid onset of the susceptible 
phenotype in response to optical stimulation corresponds to the in- 
ferred function of mesolimbic dopamine neurons in mediating rapid 
antidepressant effects'”®. Similarly, hyperpolarization-activated cation 
channels (Jj,), a key channel responsible for the transition from tonic to 
phasic firing of VTA dopamine neurons*'” that is highly expressed in 
VTA-NAc neurons”, is increased in susceptible mice, whereas I, inhi- 
bitors display rapid and long-lasting antidepressant-like efficacy*”’. 
Furthermore, the rapid antidepressant effects of ketamine*~*, sleep 
deprivation’’ and deep brain stimulation***° support the existence of 
rapidly reversible brain mechanisms and support the idea of regulating 
neural circuits to treat depression. Consistent with these studies, we 
show a rapid ‘rescue’ effect of VTA-NAc pathway inhibition when it 
occurs in the context of repeated severe social stress. In marked contrast, 
our findings that the V[TA-mPFC pathway serves the opposite function 
to the VTA-NAc pathway, are consistent with a recent study dem- 
onstrating the differential role of largely distinct VTA dopamine neu- 
ron populations in response to rewarding versus aversive stimuli’. In 
addition, our observation that modulation of the VTA-NAc pathway 
but not the VTA-mPFC pathway induces anhedonic-like effects sug- 
gests a functional role for the VTA—NAc pathway but not VTA~mPFC 
pathway in encoding reward-related information in the context of 
depression. Our projection-specific findings thus provide new insights 
into the complex role that VTA dopamine neurons have in an indivi- 
dual’s adaptations to repeated stress and the development of depression- 
like behavioural abnormalities. 


METHODS SUMMARY 

Viral-vector delivery to discrete brain regions. Viral vectors (0.5 ul) were stereo- 
taxically and bilaterally injected at a rate of 0.1 plmin ’. AAV-DIO-ChR2-eYFP 
and AAV-eYFP (purchased from the University of North Carolina Vector Core 
Facility) were delivered into the VTA of TH-Cre mice. Retrograde PRV-Cre was 
injected into either NAc or mPFC and then AAV-DIO-ChR2-eYFP or AAV- 
DIO-eYFP was injected into the VTA. All viral vectors used were functionally 
validated with electrophysiological approaches in brain slices and/or intact ani- 
mals. Retrograde pseudorabies virus (PRV)—Cre and PRV-GFP have a deletion in 
the viral thymidine kinase gene so that they do not replicate in non-dividing cells, 
cannot be transmitted trans-synaptically, and do not produce detectable effects on 
the health of infected neurons. Injected animals remained healthy when brain 
tissues were taken 7 to 10 days after surgery. 

Social-defeat stress and social-interaction test. The subthreshold social-defeat 
paradigm involved placing the test C57BL/6J mouse into the home cage of a larger 
retired breeder mouse (CD1) for 2 min during which time the experimental mouse 
was physically attacked by the CD1 mouse. After two bouts of defeat the mouse 
was returned to its home cage. Mice underwent the social-interaction test 24h 
after the defeat episodes. Mice received laser stimulation either during the sub- 
threshold defeat encounters or during the subsequent social-interaction test. For 
some experiments in Figs 2 and 3, a repeated social-defeat paradigm (5-10 min 
defeat per day for 10 days) was used to separate out susceptible and resilient mice. 
The segregation of susceptible and resilient mice was based on the social-interaction 
test. These subthreshold and repeated social-defeat paradigms have been exten- 
sively validated**””. 

Light stimulation. For in vivo behavioural experiments using ChR2 mice 
(473 nm), these mice were given low-frequency tonic (0.5 Hz) or high-frequency 
phasic (20 Hz) light stimulations. In both tonic and phasic light-stimulation pro- 
tocols, VTA dopamine neurons were exposed to 5 spikes over each 10-s period. 
For in vivo behavioural experiments using NpHR mice (563 nm), these mice were 
given 8s light on and 2s light off. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Experimental subjects. CD1 male retired breeders (Charles River), C57BL/6J 
male mice (7-9 weeks; Jackson Laboratories) and tyrosine hydroxylase (TH)- 
Cre male transgenic mice (7-9 weeks; Charles Rivers) were used in these studies. 
TH-Cre transgenic mice (EM:00254) were obtained from the European Mutant 
Archive and mated with C57BL/7 wild-type mice”*’. All TH-Cre subjects used in 
this study were backcrossed for at least 10 generations. Mice were singly housed 
and maintained on a 12-h light-dark cycle with food and water available ad 
libitum. All animal protocols were in accordance with the National Institutes of 
Health Guide for Care and Use of Laboratory Animals and approved by the Mount 
Sinai Institutional Animal Care and Use Committee. 

Virus vectors. AAV-DIO-ChR2-eYFP, AAV-DIO-NpHR-eYFP and AAV- 
DIO-eYFP virus plasmids* were initially generated using previous protocols” 
and then purchased from the University of North Carolina vector core facility 
(UNC). HSV-ChR2-eYFP and HSV-eYFP were provided by the laboratory of 
R.L.N. (MIT). Retrograde pseudorabies virus (PRV)-Cre was engineered by the 
laboratory of J.M.F. (Rockefeller University). 

Stereotaxic surgery, viral mediated gene transfer, cannula placement and 
optic-fibre placement. Mice were anaesthetized with a ketamine (100 mg kg ') 
and xylaxine (10mgkg"') mixture, placed in a stereotaxic apparatus (Kopf 
Instruments) and their skull was exposed by scalpel incision. For ChR2 viruses, 
33-gauge needles were placed either bilaterally or unilaterally at a 0° angle into the 
VTA (anterior-posterior, -3.3mm; lateral-medial, +0.5 mm; dorsal-ventral, 
-4.4 mm) and 0.5-1 ul of virus was infused at a rate of 0.1 1 min™'. NpHR virus 
was injected bilaterally into the VTA at a 7° angle (anterior—posterior, —3.3 mm; 
lateral-medial, + 1.05 mm; dorsal-ventral, -4.6 mm) and 1 pl of virus was infused 
at a rate of 0.1,lmin™'. For PRV-Cre viral injections 33-gauge needles were 
placed bilaterally at either a 10° angle into the NAc (anterior—posterior, 
+1.6 mm; lateral-medial, +1.5 mm; dorsal-ventral, -4.4mm) or 15° angle into 
the mPFC (anterior-posterior, +1.7; lateral-medial, +0.75; dorsal-ventral, -2.5) 
and 0.5-1 ull of virus was infused at a rate of 0.1 pl min“ '. Bilateral (26-gauge) or 
unilateral (20-gauge) cannulae, with a length of 3.9mm from the cannula base, 
were implanted over the VTA (anterior—posterior, -3.3 mm; lateral-medial, 
+0.5 mm; dorsal—ventral, —3.7 mm). For secure fixture of the cannula to the skull, 
instant adhesive (Loctite 454) was placed between the base of the cannula and the 
skull, and then dental cement (Ortho-Jet) was added around the cannula and the 
skull. For NpHR-expressing viral vectors, we used the chronically implantable 
optical-fibre system**. Chronically implantable fibres were homemade with 
200 tm core optic fibre and were implanted into the VTA at a 7° angle (anterior- 
posterior, -3.3 mm; lateral-medial, + 1.05 mm; dorsal-ventral, -4.4 mm). For secure 
fixture of the implantable fibre to the skull, the skull was dried and then industrial- 
strength dental cement (Grip cement; Dentsply) was added between the base of the 
implantable fibre and the skull. For AAV surgeries on TH-Cre mice (cell-specific 
study), we carried out two surgeries in which we first injected the ChR2 virus then 
performed cannula surgery at least 2 weeks later when the AAV virus was expressed. 
Mice were allowed to recover for at least 3 days before starting the behavioural 
paradigm. For AAV surgeries on C57 mice (projection-specific study) we again 
undertook two surgeries, in which ChR2 virus surgery was carried out and then 
PRV-Cre and cannula or implantable-fibre surgery was carried out at least 2 weeks 
later. To enable the mice to recover and for PRV-Cre to retrograde back to the VTA 
we waited 5 days before starting the behavioural paradigm. For HSV surgeries both 
ChR2 virus and cannula placement were carried out on the same day and mice were 
allowed to recover for at least 3 days before starting the behavioural paradigm. 
Lumafluor tracing for retrograde labelling of VTA neurons projecting to NAc or 
mPEC, the retrobeads were injected into the NAc or mPFC at the coordinates 
mentioned previously. Lumafluor (1 il) was injected into each hemisphere. 

For in vivo optical control of VTA neuronal firing with ChR2 experiments, a 
200-l1m core optic fibre was modified for attachment to the cannula. When the 
fibre was secured to the cannula, the tip of the fibre extended approximately 
0.5 mm beyond the cannula. This experimental set up was based on and slightly 
modified from previous studies'**°. For NpHR experiments 200 jm fibre core was 
attached to an chronically implanted fibre. Optic fibres were only secured in vivo 
during the behavioural paradigm (during subthreshold social-defeat or social- 
interaction tests). 

Blue light stimulation. Optical fibres (Thor Labs, BFL37-200) were connected 
using an FC/PC adaptor to a 473-nm blue laser diode (Crystal Laser, BCL-473- 
050-M), anda stimulator (Agilent Technologies, no. 33220A) was used to generate 
blue light pulses. For in vitro slice electrophysiological validation of ChR2 activa- 
tion we tested 0.1-50-Hz stimulation protocols. For in vivo optrode recordings we 
tested similar stimulation protocols. For all in vivo behavioural experiments, mice 
were given either low-frequency tonic (0.5 Hz, 15 ms) or high-frequency phasic 
(20 Hz, 40 ms) light stimulations. In both tonic and phasic light stimulation pro- 
tocols, VTA dopamine neurons were exposed to 5 spikes over each 10-s period 
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(Fig. 1c). For ChR2 activation during the subthreshold social defeat, mice received 
bilateral stimulation, whereas for all ChR2 activation during the social-interaction 
test mice received unilateral stimulations. 

Yellow light stimulation. Optical fibres (Thor Labs, SFS200/220Y) were con- 
nected using an FC/PC adaptor to a 561-nm yellow laser diode (Crystal Laser, 
CL561-050-L), and a stimulator (Agilent Technologies, no. 33220A) was used to 
generate yellow light pulses. For in vitro slice electrophysiological validation of 
NpHR activation, we tested different durations of yellow light activation (1-60 s 
duration) to examine the duration-dependent inhibition of the firing rate (Sup- 
plementary Fig. 7d). For all in vivo behavioural experiments, mice was given a 
protocol of 8 s of yellow light on and then by 2 s light off during the social-interaction 
test. All NpHR experiments were performed using bilateral stimulations. 
Social-defeat stress and social interaction. Mice underwent either a subthres- 
hold social-defeat stress paradigm or a repeated social-defeat stress paradigm’”’. 
The subthreshold paradigm involved placing the test mice (intruder) into the 
home cage of a larger, retired ‘resident’ breeder mouse (CD1) for 2 min during 
which time the experimental mouse was physically attacked (defeated) by the CD1 
mouse. After 2 min of physical interaction, the experimental mice underwent 
10 min of sensory stress. For this, a perforated plexiglass partition was placed in 
the middle of the CD1 mouse home cage, and the resident CD1 and intruder 
experimental mouse were physically separated but kept next to each other. After 
10 min of sensory stress the experimental mouse was returned to its home cage for 
5 min after which time it went through a second round of physical interaction and 
sensory stress in the home cage of anew CD1 mouse. After two bouts of defeats the 
experimental mouse was returned to its home cage and underwent social-interaction 
test the next day. In a subset of experiments the experimental mice underwent 
bilateral light stimulation (5 min per hemisphere), in order to stimulate VTA dopa- 
mine cells, during the sensory-stress period. 

For the repeated social-defeat paradigm, an experimental mouse was placed 
into the home cage of a CD1 mouse for 10 min during which time it was physically 
defeated by the CD1 mouse. After 1 min of physical interaction, the CD1 and 
experimental mouse were maintained in sensory contact for 24h using a perfo- 
rated plexiglass partition dividing the resident home cage in two. The experimental 
mice were exposed to a new CD1 mouse home cage for 10 consecutive days and 
tested for social interaction on day 11. Both subthreshold and repeated social 
defeat were carried out between 13:00 and 15:00. 

Social-avoidance behaviour towards a novel CD1 mouse was measured in a two- 
stage social-interaction test. In the first 2.5-min test (target absent), the experi- 
mental mouse was allowed to freely explore a square-shaped arena (44 X 44 cm) 
containing a wire mesh cage (10 X 6cm) placed on one side of the arena. In the 
second 2.5 min test, the experimental mouse was reintroduced back into the arena 
with an unfamiliar CD1 mouse contained behind a wire mesh cage. Video tracking 
software (Ethovision 3.0, Noldus) was used to measure the amount of time the 
experimental mouse spent in the ‘interaction zone’ (14 X 26cm), ‘corner zone’ 
(10 X 10 cm) and ‘total travel’ within the arena. Ina subset of studies experimental 
mice underwent unilateral blue light stimulation to activate the VTA dopamine 
cells or yellow light stimulation to inhibit VTA dopamine cells, during both the 
target absent and target present sessions of the social-interaction test. The inter- 
action ratio was measured as (interaction time, target present)/(interaction time, 
target absent) and normalized to 100. Susceptible and resilient mice were segre- 
gated based on the interaction ratio: mice with scores <100 were defined as 
‘susceptible’ and those with scores =100 were defined as ‘resilient’. To confirm 
that the optogenetic activation-induced depression-related avoidance is not a 
simple fear memory ofa CD1 aggressor, a social-interaction test using a C57 mouse 
asa target was carried out. Mice that had undergone subthreshold social defeat and 
then phasic stimulation during the social-interaction test similarly avoided the 
non-aggressive C57 social target to the same degree as observed with a CD1 target 
(Supplementary Fig. 2). This is consistent with our previous work showing that a 
C57 social target induced social avoidance in the repeated social-defeat paradigm’. 
Sucrose preference. Mice were initially habituated to 50-ml tubes with stoppers 
fitted with ball-point sipper tubes (a two-bottle choice) filled with drinking water 
2 days before the sucrose-preference measurements. After completion of the 
social-interaction test mice were given access to a two-bottle choice of water or 
1% sucrose solution. Bottles containing water and sucrose were weighed at several 
time points during the day at 15:00, 18:00 and 09:00 for 3 days. The position of the 
bottles was interchanged (left to right, right to left) after each weight measurement 
to ensure that the mice did not develop a side preference. Sucrose preference was 
calculated as a percentage (amount of sucrose consumed X 100 (bottle A)/total 
volume consumed (bottles A + B)). Total sucrose consumption during the first 
12h after the social-interaction test was measured and used to obtain sucrose 
preference in this study. 

Elevated plus maze. Mice were placed in the centre of an elevated plus maze (arms 
are 33 X 5cm, with 25-cm tall walls on the closed arms) and their behaviour was 
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digitally tracked for 5 min. Mice received VTA optical stimulation during the whole 
duration of the elevated plus maze. A video-tracking system (Ethovision 3.0, Nol- 
dus) was used to measure the amount of time spent in the closed and open arms. 
Open field. Mice were tested for their activity during 15 min in an open field arena 
(44 X 44cm). A video-tracking system (Ethovision 3.0, Noldus) was used to mea- 
sure the locomotor activity of the animal, as well as the time spent in the centre 
(34 X 34cm) and periphery of the test arena. 

In vitro patch-clamp electrophysiology. Whole-cell optogenetic recordings were 
obtained from VTA dopamine neurons in acute brain slices from TH-Cre mice 
that had been stereotaxically injected with AAV-DIO-ChR2-eYFP into the VTA, 
C57 mice that been injected with HSV-ChR2-eYFP into the VTA or C57 mice that 
been injected with the retrograde viral vector PRV-Cre in either the NAc or mPFc, 
and AAV-DIO-ChR2-eYFP or AAV-DIO-NpHR-eYFP into the VTA. Cell- 
attached recordings were obtained from VTA dopamine neurons in acute brain 
slices from wild type C57 mice that had been stereotaxically injected with the 
retrograde dye lumafluor in either NAc or mPFc. To minimize stress and to obtain 
healthy VTA slices, mice were anaesthetized and perfused immediately for 40-60 s 
with ice-cold aCSF (artificial cerebrospinal fluid), which contained 128 mM NaCl, 
3 mM KCl, 1.25 mM NaHP04, 10 mM D-glucose, 24 mM NaHCOs, 2 mM CaCl, 
and2 mM MgCl, (oxygenated with 95% O, and 5% CO), pH 7.4, 295-305 mOsm). 
Acute brain slices containing VTA dopamine neurons were cut using a microslicer 
(DTK-1000, Ted Pella) in cold sucrose aCSF, which was derived by fully replacing 
NaCl with 254mM sucrose and saturated by 95% O, and 5% COs. Slices were 
maintained in holding chambers with aCSF for 1h at 37°C. Patch pipettes 
(3-5 mQ) for whole-cell current-clamp, voltage-clamp and cell-attached record- 
ings were filled with internal solution containing the following: 115 mM potassium 
gluconate, 20 mM KCl, 1.5mM MgCl, 10 mM phosphocreatine, 10 mM HEPES, 
2mM magnesium ATP and 0.5mM GTP (pH 7.2, 285 mOsm). Whole-cell and 
cell-attached recordings were carried out using aCSF at 34 °C (flow rate = 2.5 ml 
min‘). For whole-cell recordings during optogenetic stimulation resting mem- 
brane potential and action potentials were recorded in current-clamp mode and 
inward current measurements were made in voltage-clamp mode using the 
Multiclamp 700B amplifier and data acquisition was done in pClamp 10 
(Molecular Devices). Series resistance was monitored during the experiments 
and membrane currents and voltages were filtered at 3 kHz (Bessel filter). For 
ChR2 experiments, sustained and trains (0.1-50 Hz) of blue light were generated 
by a stimulator (mentioned above) and delivered to VTA dopamine neurons 
expressing ChR2 through a 200-mm optic fibre attached to a 473-nm laser. For 
NpHR experiments, different durations of yellow light were generated by a stimu- 
lator mentioned above and delivered to VTA dopamine neurons expressing NPHR 
through a 200-mm optic fibre attached to a 561-nm laser. For cell-attached action 
potential recordings, signals were band-pass filtered at 300 Hz-1 kHz to identify 
dopamine neurons and were then bessel filtered at 10 kHz (gain 50) using a 
Multiclamp 700B amplifier and data acquisition was carried out using pClamp 
10. For measurements of the spontaneous activity of VTA dopamine neurons, 
cell-attached recordings were performed. To measure the intrinsic membrane 
properties of VTA dopamine neurons, whole-cell recordings were carried out in 
current-clamp mode and spikes were induced by incremental increases of current 
injection (each step increase was 50 pA; range 50-200 pA). 

In vivo optrode recording. Viral injection and in vivo optrode recordings were 
carried out as described previously’*’. AAV-DIO-ChR2-eYFP was injected into 
VTA (anterior—posterior, -3.44 mm; lateral-medial, +0.48 mm; dorsal-ventral, 
-4.4mm) of TH-Cre mice and left for at least 2 weeks to allow for viral expression. 
Prior to recordings mice were deeply anaesthetized with a ketamine and xylaxine 
mixture, and placed ona stereotactic frame and attached to a temperature regulator. 


The area of skull directly above the VTA was removed and an optrode consisting 
of a tungsten electrode (1 m@ resistance; 125 um length) tightly bundled with 
an optical fibre (200-11m core, 0.2 NA) was lowered to the boundary of the 
VTA (anterior—posterior, -3.44 mm; lateral-medial, +0.48 mm; dorsal-ventral, 
-4mm). The tip of the electrode protruded about 0.4 mm beyond the optical fibre 
to ensure illumination of the recorded neurons. At the VTA boundary simultane- 
ous optical stimulation and recordings of the electrical response of neurons expres- 
sing ChR2 was carried out as the optrodes were gradually lowered in 0.1-mm 
increments until a clear response to optical stimulation was measured. The optical 
fibre was coupled to a 473-nm solid-state laser diode with an approximately 
20-mW output. Recorded signals were band-pass filtered at 300 Hz-5 kHz using 
an 1800 Microelectrode AC amplifier. 

Immunohistochemistry. For double-immunofluorescence experiments, tissue 
sections were fixed in 4% (weight divided by volume; wt/vol) PBS-buffered para- 
formaldehyde. Tissues and cells were blocked in PBS-T (0.3% Triton X-100) 
including 2% (wt/vol) bovine serum albumin (BSA; Sigma) and then exposed 
overnight to these primary antibody mixtures: anti-GFP (Invitrogen, 1:2000) 
and anti-TH (Sigma, 1:10,000). Detection of primary antibodies was carried out 
with mixtures of the secondary antibodies Cy2-anti-mouse and Cy3-conjugated 
anti-rabbit (1:500; Jackson ImmunoResearch). Anti-NeuN (Aves Laboratories, 
NUN) raised from chicken was used at a dilution ratio of 1:1000 and was detected 
by Cy3-anti-chicken. For TH labelling, anti-TH (Sigma) from mouse at a dilution 
ratio of 1:5000 was used and detected with Cy5-anti-mouse. For GFP labelling, 
anti-GFP (Invitrogen) from rabbit was used at a dilution ratio of 1:2000 and was 
detected by Cy2-anti-rabbit. Tissues and cells were counterstained and mounted 
with anti-fade solution, including 4’ ,6-diamidino-2-phenylindole (DAPI; Vecta- 
Shield, Vector Laboratories). Sections were subsequently imaged (20 magnifica- 
tion) on a LSM 710 confocal (Zeiss). Cell counting was carried out manually using 
ImageJ. For immuno-validation of PRV-Cre, animals were killed with a lethal 
dose of Isofluorane and then transcardially perfused with PBS then 10% formalin 
5-10 days post PRV-Cre injections. After perfusion, brains were kept in 10% 
formalin at 4°C for 18-24h, at which point the brains were transferred to PBS 
and sliced into coronal sections using a vibrating blade microtome (Leica 
Microsystems, model VT1000S). For immunostaining, we processed the slices 
using antibodies against GFP (Abcam, chicken, 1:1000) and tyrosine hydroxylase 
(Pel-Freez, rabbit, 1:1000). The antibodies were labelled with goat anti-chicken 
Alexa 488 and goat anti-rabbit Alexa 633 (Invitrogen, 1:1000), respectively. 
mCherry expression from the AAV was sufficiently intense for confocal micro- 
scopy. Sections were imaged at X20 magnification for analysis, which was per- 
formed manually. 
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Major depression is characterized by diverse debilitating symp- 
toms that include hopelessness and anhedonia’. Dopamine neu- 
rons involved in reward and motivation’’ are among many neural 
populations that have been hypothesized to be relevant"’, and certain 
antidepressant treatments, including medications and brain stimu- 
lation therapies, can influence the complex dopamine system. Until 
now it has not been possible to test this hypothesis directly, even in 
animal models, as existing therapeutic interventions are unable to 
specifically target dopamine neurons. Here we investigated directly 
the causal contributions of defined dopamine neurons to multi- 
dimensional depression-like phenotypes induced by chronic mild 
stress, by integrating behavioural, pharmacological, optogenetic 
and electrophysiological methods in freely moving rodents. We 
found that bidirectional control (inhibition or excitation) of speci- 
fied midbrain dopamine neurons immediately and bidirectionally 
modulates (induces or relieves) multiple independent depression 
symptoms caused by chronic stress. By probing the circuit imple- 
mentation of these effects, we observed that optogenetic recruitment 
of these dopamine neurons potently alters the neural encoding of 
depression-related behaviours in the downstream nucleus accum- 
bens of freely moving rodents, suggesting that processes affecting 
depression symptoms may involve alterations in the neural encoding 
of action in limbic circuitry. 

Major depressive disorder is a debilitating disease characterized by 
diverse symptoms that include depressed mood, suicidality, psycho- 
motor retardation or agitation, reduced motivation or hopelessness, 
and anhedonia’. Current medical therapies for depression take weeks 
to achieve full efficacy, and are ineffective in many patients or cause 
intolerable side effects, pointing to the need for deeper understanding 
of depression states and treatment. Given that there is strong evi- 
dence linking the mesolimbic dopamine system with reward-related””, 
hedonic*® and motivated’ behaviours, we proposed to test whether 
midbrain dopamine neurons could have a causal role in both induction 
and relief of diverse stress-induced depression-related behaviours’®, 
including reduced motivation and anhedonia**”®"". To test this, we 
developed and used an array of tools for real-time assessment and 
control of depression-related states in freely moving rodents. 

We first tested whether selective inhibition of ventral tegmental area 
(VTA) dopamine neurons could acutely induce multiple depression- 
like behaviours. We expressed an enhanced halorhodopsin that hyper- 
polarizes neuronal membranes upon illumination (eNpHR version 
3.0; eENpHR3.0) in VTA tyrosine hydroxylase (TH)-expressing dopa- 
mine neurons, by injecting a Cre-dependent adeno-associated virus 
(AAV) carrying this eNpHR3.0 into the VTA of TH::Cre mice® (Fig. la 
and Supplementary Fig. 1). eNpHR3.0 was fused to the enhanced 
yellow fluorescent protein (eYFP) so that targeting specificity could 
be tracked by quantifying the proportion of VTA neurons expressing 


eNpHR3.0-eYFP (or eYFP alone in age-matched controls) that were 
also TH-positive (Fig. 1b and Supplementary Fig. 2). 

Primary categories of depression assays in rodents involve well- 
validated tests of motivation and anhedonia'** in which performance 
degrades with chronic stress and improves with chronic antidepressant- 
medication treatment’*'*. In the context of depressive phenotypes, 
motivation is assayed by presenting rodents with an inescapable stressor, 
such as those used in the tail-suspension test (TST) or forced-swim test 
(FST), and quantifying the proportion of time spent performing escape- 
related behaviour (struggling) relative to time spent immobile, which has 
been interpreted as a sign of behavioural despair or passivity’. Here we 
compared mice with VTA dopamine neurons expressing eNpHR3.0 
to eYFP-expressing controls, and observed a significant and reversible 
reduction in escape-related behaviour upon illumination (Fig. 1c). We 
investigated whether this reduction in struggling was related to a gross 
locomotor effect, rather than a more selective decrease in motivation to 
escape, by evaluating inhibition of VTA dopamine neurons in these same 
animals during exploration of an open-field-test (OFT) arena. There was 
no significant difference in locomotion between groups (Fig. 1d and 
Supplementary Figs 3 and 4). Although no difference was detected, any 
trend towards subtly decreased locomotion in the eNpHR3.0 group upon 
illumination in the OFT would still be consistent with a depression- 
related phenotype, given that related motor changes are clinically observed 
as psychomotor retardation and/or reduced motivation to explore (Sup- 
plementary Information). 

We next developed a temporally precise version of an assay for 
anhedonia, the sucrose-preference test’*"*. We quantified (by auto- 
mated detection) the number of licks on spouts delivering either water 
or 1% sucrose during a 90-min session consisting of a baseline 30-min 
light-off epoch, followed by a 30-min light-on epoch, and ending with 
another 30-min light-off epoch (Fig. le and Supplementary Fig. 5a). 
Notably, we observed a significant reduction in sucrose preference 
during illumination in eNpHR3.0 mice but not eYFP control mice 
(Fig. le), and no significant group-by-epoch interaction (two-way ana- 
lysis of variance (ANOVA)) in total licks (Supplementary Fig. 5a). 
Therefore, optogenetic inhibition of VTA dopamine neurons defined 
a phenotype that could be immediately induced and reversed, and 
that was consistent with multiple depression-symptom categories of 
behavioural despair and anhedonia’’. 

To test for a causal link to existing depression-like states, we also 
probed the ability of dopamine-neuron activation to correct indepen- 
dent depression-like phenotypes induced by stress*. As clinical depres- 
sion can be linked to stressors operating over months, we used the 
time-intensive chronic mild stress (CMS) paradigm in which unpre- 
dictable mild stressors are delivered twice daily for 8 to 12 weeks 
to induce a depressive-like state’*, CMS has been shown to pro- 
duce decreases in motivation, assayed by reduction in escape-related 


1Picower Institute for Learning and Memory, Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. “Department of Bioengineering, Stanford 
University, Stanford, California 94305, USA. *Neurosciences Program, Stanford University, Stanford California 94305, USA. “Department of Psychiatry and Behavioral Sciences, Stanford University, 
Stanford, California 94305, USA. Howard Hughes Medical Institute, Stanford University, Stanford California 94305, USA. ®CNC Program, Stanford University, Stanford, California 94305, USA. 
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Figure 1 | Selective inhibition of VTA dopamine 
neurons induces a depression-like phenotype. 

a, Cre-dependent AAV. b, Confocal images of 
midbrain dopamine neurons. Orange dotted lines, 
location of fibre optic. Bottom, images of VTA 
neurons directly below fibre track. c, Photoinhibition 
of VTA dopamine neurons acutely reduces escape- 
related behaviour. Two-way ANOVA demonstrates 
the group-by-light epoch interaction (interaction of 
the experimental-group factor and light-condition 
factor in the test), F433 = 3.95, P = 0.00089; 
Bonferroni post-hoc test shows reduced struggling in 
the eNpHR3.0 group relative to the eYFP group, 

*P <.05. Error bars, s.e.m. d, Inhibition of VTA 
dopamine neurons does not produce a significant 
difference in open-field locomotion; two-way 
ANOVA did not demonstrate a significant group- 
by-light epoch interaction, F3,43 = 1.76, P = 0.17. 
Error bars, s.e.m. e, Schematic and results of the 90- 
min sucrose-preference test. Photoinhibition of VTA 
dopamine neurons acutely reduces sucrose 
preference; two-way ANOVA revealed that opsin 
expression has a significant effect, F),42 = 6.31, 

P= 0.016; Bonferroni post-hoc test revealed 
significant differences between groups only in the 
light-on epoch, *P < 0.05. Error bars, s.e.m. Off, light 
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behaviour in the face of inescapable stressors, as well as anhedonia, as 
measured by reduced sucrose preference’**. 

To phasically activate VTA dopamine neurons, we selectively 
expressed channelrhodopsin-2 (ChR2) in these cells to achieve tem- 
porally precise excitation with light. We included four groups of 
mice (Fig. 2a and Supplementary Fig. 6): mice with ChR2-transduced 
VTA dopamine neurons and exposed to CMS; mice with eYFP- 
transduced VTA dopamine neurons and exposed to CMS; mice with 
ChR2-transduced VTA dopamine neurons and housed in a low-stress 
environment (non-CMS); and mice with eYFP-transduced VTA 
dopamine neurons and non-CMS. 

We examined animals during baseline (light-off), phasic illumina- 
tion (light-on) and post-illumination (light-off) epochs (Fig. 2b) dur- 
ing multiple behavioural assessments. To produce phasic firing in 
VTA dopamine neurons, we used a physiological, sparse bursting 
illumination pattern (Fig. 2b; 8 pulses at 30 Hz, 5-ms pulse width, every 
5s) similar to patterns known to elicit phasic spiking and dopamine 
transients® during illumination epochs. To assess behavioural motiva- 
tion we exposed all four groups of animals to the TST. At baseline, 
consistent with previous studies’’, we observed (Fig. 2c) that CMS 
markedly reduced the amount of escape-related behaviour relative to 
non-CMS controls by approximately 50%; moreover, upon illumina- 
tion, the ChR2-CMS group showed a robust increase in escape-related 
behaviour, relative to the eYFP-CMS group (P < 0.001). Thus, phasic 
illumination of VTA dopamine neurons in ChR2 CMS mice, but not 
eYFP CMS mice, reversed the CMS-induced depression-like pheno- 
type back to non-CMS levels, with timing on the order of seconds 
(Fig. 2c). 

As dopamine neurons in some systems are linked to locomotion’, we 
examined locomotion in the OFT chamber (of the same mice included 
in the TST assay) during two 3-min light-on epochs, interleaved with 
two 3-min light-off epochs using the same phasic illumination para- 
meters described above (Fig. 2d). On the timescale corresponding to 
TST effects, only statistically non-significant interactions with altered 
OFT velocity were observed in ChR2 groups upon illumination. 

Next, we tested whether phasic activation of VTA dopamine neu- 
rons could also reverse CMS-induced decreases in sucrose preference 
(Fig. 2e). Consistent with previous studies, baseline measurements 
showed that eYFP and ChR2 CMS mice expressed. significantly 
lower sucrose preference than eYFP and ChR2 non-CMS mice before 
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illumination on the lickometer assay (Bonferroni post-hoc tests, 
P<0.05 and P<0.01, respectively). However, phasic activation of 
VTA dopamine neurons acutely reversed the CMS-induced anhedonic 
effect in ChR2 CMS, but not eYFP CMS, animals (Fig. 2e) without 
altering the total number of licks across groups (Supplementary Fig. 5b). 

Together these results show bidirectional effects of VITA dopamine 
neuron activity on rapidly inducing and ameliorating diverse depression- 
related behaviours. We next considered how chronic mild stress itself 
might influence VTA neuronal activity. To investigate this, we recorded 
from the VTA of CMS (n = 93 neurons) or non-CMS (n = 71 neurons) 
animals to probe neuronal firing properties (Supplementary Figs 8-10). 
The mean spike rate was maintained between groups, but out of the 
total spikes recorded, VTA neurons in non-CMS rats had a significantly 
greater proportion of spikes occurring within bursts, had longer-duration 
bursts, and had more spikes in each burst compared to VTA neurons in 
CMS rats (Supplementary Fig. 9), indicating that CMS alters normal 
bursting activity. 

As VTA dopamine neurons project to multiple regions throughout 
the brain”®, it is not clear which downstream targets may be contribu- 
ting to the changes in depression-related behaviours, although it is 
perhaps relevant that clinical deep brain stimulation in the NAc 
(among other targets) has been proposed to modulate depression- 
related symptoms’. Since the NAc does receive (among many other 
inputs) dopaminergic innervation in vivo that can guide certain moti- 
vated behaviours® and glutamatergic innervation involved in reward- 
related conditioning, we next tested whether glutamate or dopamine 
transmission in the NAc could mediate the light-induced rescue of the 
depression-like phenotype, using mice that were implanted with 
bilateral guide cannulae in the NAc in the setting of viral ChR2 trans- 
duction of VITA dopamine neurons and chronic implantation of a 
fibre-optic cable targeted at the VTA (Supplementary Figs 11 and 
12), before undergoing 8 to 12 weeks of CMS. 

We carried out within-subject comparisons counter-balanced for 
order, and infused either saline, or glutamate-receptor antagonists, into 
the NAc before the TST. Glutamate-receptor antagonism did not attenu- 
ate the light-induced reversal of reduced struggling following CMS seen, 
but instead increased overall struggling (Supplementary Fig. 11a). As 
the NAc receives glutamatergic innervation from many regions, we 
speculate that the net effect of these inputs may suppress struggling in 
the TST, perhaps consistent with acute clinical anti-depressant effects 
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of the much more broadly acting systemic drug ketamine’”®. In contrast, 
in a second within-subject comparison, we found that dopamine- 
receptor blockade in the NAc reduced baseline struggling and blocked 
the light-induced reversal of reduced struggling following CMS 
(Supplementary Fig. 11b). These findings are consistent with the hypo- 
thesis that dopaminergic innervation from the VTA to the NAc is 
important for maintaining baseline levels of motivated escape beha- 
viour as well as for elevations in this behaviour elicited by phasic 
activity of VIA dopamine neurons. 

Together, these data have shown that selective inhibition of VTA 
dopamine neurons acutely produces depression-related behaviour in 
measures of both motivation and anhedonia, and that phasic activa- 
tion of VTA dopamine neurons acutely rescues a chronic mild stress- 
induced depression-like phenotype, in a phenomenon that requires 
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Figure 2 | Temporally sparse phasic photoactivation of VTA dopamine 
neurons rescues stress-induced depression-like phenotype. a, Experimental 
groups. b, Schematic of illumination pattern; 473-nm light used to elicit phasic 
bursts of activity in ChR2-expressing VITA dopamine neurons. c, Phasic 
illumination of VTA dopamine neurons rescues stress-induced reduction in 
struggling on TST in ChR2 CMS but not eYFP CMS mice (**P < 0.001; eYFP 
CMS = 26.8s + 3.5s; ChR2 CMS = 28.0s + 2.7 s; ChR2 non-CMS = 55.63 s 

+ 4.45; eYFP non-CMS = 56.0s + 4.7 s). Two-way ANOVA revealed a significant 
group-by-light epoch interaction (F¢,134 = 6.04, P< 0.0001) and a significant 
influence of experimental condition on performance as revealed by one-way 
ANOVA (F367 = 6.20, P = 0.0009; Bonferroni post-hoc test). Error bars, s.e.m. 
d, Illumination parameters, as used in the TST, did not change locomotor activity in 
the open field, with no significant group-by-epoch interaction in two-way ANOVA 
(Fo,152 = 0.99, P = 0.4493), and no detectable differences revealed by Bonferroni 
post-hoc tests on the same timescale. Error bars, s.e.m. e, Phasic activation of VTA 
dopamine neurons rescued the stress-induced decrease in sucrose preference in 
ChR2 CMS, but not eYFP CMS mice; one-way ANOVA, Dunn’s post-hoc test 
comparing baseline to light-on epoch, P< 0.01 for ChR2 CMS mice, P = 0.2851 
for eYFP CMS mice. Two-way ANOVA revealed a significant group-by-light 
epoch interaction (Fe,62 = 4.33, P = 0.001), and a significant influence of 
experimental condition on performance was revealed by one-way ANOVA 

(F331; = 3.40, P = 0.0299); **P < 0.01 for ChR2 CMS mice. Error bars, s.e.m. 
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functioning of dopamine receptors in the NAc. Next, to investigate 
high-temporal resolution neural representations of depression-related 
action in downstream neurons under baseline and phasic activation 
conditions of VTA dopamine neurons, we combined several new tools. 
Although the sucrose-preference task is not suited for high-temporal 
resolution assessment of neural representations of behaviour, as the 
outcome is manifested slowly and stochastically over many tens of 
minutes, we considered that escape-related behaviour might be better 
suited for simultaneous circuit-dynamics measurement. However, 
FST quantification traditionally involves low temporal-resolution 
annotation of immobility epochs. We required a new method for 
millisecond-temporal-resolution detection of escape-related beha- 
viour for integration with optogenetics and in vivo electrophysiology. 
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Figure 3 | Phasic activation of VTA dopamine neurons modulates escape- 
related behaviour in TH::Cre rats. a, In vivo electrophysiology sessions. 

b, Integration of recordings in the NAc, illumination of ChR2-expressing VTA 
dopamine neurons, and precision measurement of swimming behaviour in five 
TH::Cre rats treated with CMS. ¢, Light-induced modulation of swimming; 
phasic illumination of ChR2-expressing VTA dopamine neurons increases 
escape behaviour of TH::Cre rats in FST. Thick red line, mean kick rate; dotted 
lines, s.e.m. d, Phasic illumination of ChR2-expressing VTA dopamine neurons 
increases kick rate in FST (paired t-test; **P = 0.0088; mean increase in kick 
rate of 73.9% + 21.2% relative to light-off epochs) but not ambulation rate in 
OFT (mean decrease of 60.2% + 22.9%; mean + s.e.m.). e, Peri-event raster 
histogram showing kick events referenced to the train of 8 light pulses at 30 Hz 
every 5s (blue); kick events are not time-locked to light pulses (bin width of 
0.175 s for all histograms). f, Scatter-plot relating the degree to which behaviour 
of each rat was modulated by light-driven VTA dopamine neuron activation, to 
the proportion of all light-responsive units recorded from the same subject 
(Pearson’s correlation Test, P = 0.0271, r= 0.9195). 
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Because even a lightweight electrophysiological recording headstage 
impeded normal escape-related behaviours in mice, this experiment 
required TH::Cre rats’, which were able to express escape-related beha- 
viours while bearing an electrophysiological recording headstage in the 
FST without mechanical noise (Fig. 3a—c). We used a novel high-speed 
method of magnetic induction-based assessment of depression-related 
behaviour, placing a magnetic coil surrounding the forced-swim tank 
and a small magnet attached to the hind-paw to measure swimming 
kicks or escape-related behaviour. We validated that illumination of 
ChR2-expressing VTA dopamine neurons in TH::Cre rats rapidly and 
reversibly increased escape-related behaviour (Fig. 3 and Supplemen- 
tary Fig. 13) but did not alter locomotion in the OFT (Fig. 3d and 
Supplementary Fig. 14). 

We carried out in vivo electrophysiological recordings in the NAc of 
TH::Cre rats’ during baseline activity in the home cage, during 
exploration in the OFT, and during the FST (Fig. 3a and Supplemen- 
tary Fig. 15) while illuminating ChR2-expressing VTA dopamine neu- 
rons following CMS (Fig. 3b, c). We observed a robust increase in kick 
frequency during light-on epochs relative to light-off epochs, but did 
not observe time-locked kicking to light pulses on the order of seconds 
(Fig. 3c-e), suggesting that this behaviour may be modulated by dopa- 
minergic tone rather than individual dopamine transients. We also 
found that the more NAc neurons showing phasic responses to VTA 
dopamine activation in an animal, the greater the relative increase in 
escape-related behaviour, but not OFT locomotion (Supplementary 
Fig. 16 and Fig. 3f). 

We examined the proportion of NAc neurons ( = 123) showing 
phasic responses to light pulses in VTA or phasic responses associated 
with FST kicks (Supplementary Figs 17 and 18). Many NAc neurons 
(45 out of 123 (37%); Fig. 4a) showed phasic responses to VTA light 
(see Supplementary Materials for statistical methods). Of these 45 neu- 
rons, 6 showed phasic inhibition to light, whereas the dominant popu- 
lation (39 out of 45) showed phasic excitation. Separate from phasic 
responses, we also observed that upon FST exposure, more NAc neu- 
rons showed decreases than increases in mean epoch firing rate com- 
pared with home-cage firing (Supplementary Fig. 19). Furthermore, 49 
out of 123 NAc neurons (40%; Fig. 4a) encoded escape-related beha- 
viour, as seen by phasic responses associated with kick events; 14 of 
these 49 (29%) showed phasic inhibition with kick, and the dominant 
population (35 out of 49; 71%) showed phasic excitation with kick. Of 
the 123 neurons, 24 encoded both light pulses and kick events (Fig. 4a 
and Supplementary Fig. 18; all summary data can be found in Sup- 
plementary Fig. 19). 
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As we found robust phasic electrophysiological responses to escape- 
related behaviours in the NAc, and observed that VTA dopamine 
action was causally involved in depression-related behaviour (Figs 1 
and 2) and that dopamine signalling in the NAc was required for 
mediating escape-related behaviours (Supplementary Fig. 11b), we 
examined relevant circuit responses by determining whether VTA 
dopamine neurons modulated the neural representation of escape 
behaviour in the NAc. We began by distinguishing kick events occur- 
ring during light-on (VTA dopamine neuron activation) epochs from 
those occurring during light-off epochs. Of the 123 neurons, 14 (11%) 
showed phasic responses to kicks that occurred during both light-on 
and light-off epochs; 13 of these 14 cells similarly represented kick in 
both epochs (Fig. 4b; Example cell 6), and only 1 of these 14 cells dif- 
ferentially represented kick in the light-on epoch relative to the light-off 
epoch (Fig. 4b; Example cell 5). Next, we found that light activation of 
VTA dopamine neurons powerfully altered the encoding of kick events 
in two substantial populations of NAc neurons (Fig. 4b). Of the 
123 neurons, 13 (11%) selectively encoded escape-related behaviour 
but only during the light-on (VTA dopamine neuron activation) epoch, 
and 16 out of 123 neurons (13%) selectively encoded escape-related 
behaviour but only during the light-off (baseline) epoch (Fig. 4b). 
Optogenetic recruitment of VTA dopamine neurons, which gives rise 
to antidepressed-like escape behaviour, therefore appeared to recruit a 
novel representation of the escape action in the NAc of freely moving 
rodents (Fig. 4). 

Here, we have shown that selective inhibition of VITA dopamine 
neurons acutely induces multiple distinct depression-like behaviours, 
that chronic mild stress induces a lasting depression-like phenotype 
with multiple symptoms reversed by phasic activation of VTA dopa- 
mine neurons, that NAc dopamine (but not glutamate) receptors are 
required for the action of these neurons in depression-related escape 
behaviour, and that NAc neurons encode phasic activation of VTA 
dopamine neurons as well as depression-related escape behaviour. 
Furthermore, we have shown that the NAc representation of this 
depression-related behavioural action is in fact fundamentally altered 
by VTA dopamine neuron activation. These findings may together 
provide circuit-level insight into the causal dynamics of depression- 
related behaviour. 

Certain earlier VTA recordings in depression models have shown 
reduced bursting that was restored after treatment with antidepressant”, 
consistent with our results and the finding that attenuation of the meso- 
cortical dopaminergic pathway may be linked to social-defeat suscepti- 
bility°®. Conversely, other studies have instead shown increased bursting 


Figure 4 | Phasic activation of VTA dopamine 
neurons modulates NAc encoding of escape- 
related behaviour. Recording of 123 NAc 
neurons from 5 CMS TH::Cre rats expressing 
ChR2 in VTA dopamine neurons. a, Peri-event 
raster histograms for representative neurons 
showing phasic excitation associated with both 
light pulses and kick events (Example cell 1; 15 


Example cell 4 
Light on kicks 


Kicks during 


Light light on 


21 


Light on kicks 
eqrane: oH 
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24 cells). b, Peri-event raster histograms for 
representative neurons that selectively encoded 
kick events during either the light-on epoch 
(Example cell 3; 13 cells) or the light-off epoch 
(Example cell 4; 16 cells), or encoded kick events 
in both light-on and light-off epochs differentially 
(Example cell 5; 1 out of 14 cells) or similarly 
(Example cell 6; 13 out of 14 cells). 
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in mice susceptible to a 10-day social defeat*’”. Differences might 
stem from the models used, as chronic mild stressors could reduce 
dopamine neuron firing, whereas more severe stressors analogous to 
those employed in social defeat models could increase dopamine neu- 
ron firing’. Our results on depression-related behaviours are consis- 
tent with models of dopamine-mediated responses in reward-related 
behaviour’, with VTA dopamine firing underlying anticipation or 
receipt of reward”*°*°, with VTA GABA-neuron firing underlying aver- 
sion or anhedonia**”’, with chemical and electrical VT A-stimulation 
effects on feeding, and with antidepressant effects achieved by deep brain 
stimulation in the NAc'”*. It is important to emphasize in this context 
that the effects of stress on the mesolimbic dopamine system are highly 
complex”*, as different stressors can cause opposite responses from VTA 
neurons depending on pre-exposure and severity”. 

Depression and its treatment are likewise exceedingly complex; 
for example, serotonergic pathways and agents exert antidepressant 
effects, and antidepressant effects of deep brain stimulation are seen in 
the subgenual cingulate cortex distinct from the classical mesolimbic 
dopamine system”. These results underscore the fact that psychiatric 
diseases defined by a constellation of different classes of symptoms can 
be influenced by multiple neural circuit processes. The heterogeneity 
of mood disorders further complicates the pinpointing of precise cir- 
cuit dysfunctions mediating symptoms of depression*®. In animal 
models, tests such as sucrose consumption (when depressed patients 
can experience either increased or decreased appetite) and the FST 
(which may involve transitions between active and passive coping 
strategies) must be interpreted with caution. However, studying tar- 
geted circuits that are well-conserved between rodents and humans, 
and that (as we have found) specifically cause, correct and encode 
diverse symptom classes of depression, will likely continue to 
advance our understanding of the biological underpinnings of depres- 
sion and related behaviours. 


METHODS SUMMARY 


Cre-dependent AAVs were packaged as AAV; maps and clones are available at 
http://www.optogenetics.org. The VTA in TH::Cre-driver rats or mice was virally 
transduced and an optical fibre surgically implanted. To confirm opsin expression 
and fibre targeting, coronal brain slices were prepared for immunohistochemistry and 
optical microscopy; stained brain sections were imaged with confocal microscopy. 

Behavioural and electrophysiological data were analysed in Matlab and Neuro- 
explorer with custom-written software. For rat tetrode recordings under FST, 
before the start of recordings rats were anaesthetized for 10 min to facilitate water- 
proofing of the headstage and electrodes, and were subsequently allowed to 
recover for at least 1h. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Subjects. TH::IRES-Cre homozygous male mice (Figs 1 and 2) obtained from the 
European Mouse Mutant Archive (mouse line EM:00254) were bred with wild- 
type C57BL/7 females to produce subjects. Only heterozygous TH::IRES-Cre male 
offspring that had been back-crossed for at least five generations were used for 
experiments. Animals were age matched, underwent surgery and were divided 
into two groups: the chronic mild stress (CMS) group and the control non-CMS 
group. Non-CMS animals were housed in a quiet room with a reverse 12h light- 
dark cycle and given food and water ad libitum. CMS animals were housed sepa- 
rately and were subjected to standard CMS protocols; two stressors per day for 8 to 
12 weeks before behavioural testing. Mice in this group experienced one stressor 
during the day and a different stressor during the night. Well-validated and 
approved standard stressors were randomly chosen from the following list so as 
to be unpredictable for subjects: cage tilt on a 45° angle for 1 to 16 h; food depriva- 
tion for 12 to 16h; white noise (http://www.simplynoise.com) for 1 to 16h; strobe 
light illumination for 1 to 16h; crowded housing (4 to 5 mice ina 10 cm X 13cm 
x 13cm plastic box with air holes) for 1 to 3h; individual housing (separating 
cage-mates into single-housing cages) for 1 to 16h; light cycle (continuous illu- 
mination) for 24 to 36h; dark cycle (continuous darkness) for 24 to 36h; water 
deprivation for 12 to 16h; damp bedding (200 ml water poured into sawdust 
bedding) for 12 to 16h. 

When not undergoing crowded housing, food deprivation or water deprivation 
stressors, water and food were available ad libitum. Experimental protocols were 
approved by Stanford University Institutional Animal Care and Use Committee 
(IACUC) and meet guidelines of the National Institutes of Health guide for the 
Care and Use of Laboratory Animals. 

TH::Cre BAC transgenic rats (Fig. 4) were bred by mating Cre-positive founders 
to wild-type rats to produce TH::Cre heterozygous transgenic rats on a Long- 
Evans background. Only male rats aged 5 to 7 months were used for experiments. 
All rats underwent a 5- to 7-week chronic mild stress paradigm before behavioural 
testing and electrophysiological recording. Animals were subjected to the same 
twice-daily stressors on the same time course, except crowded housing and indi- 
vidual housing were both replaced with a standard restraint tube stressor in which 
the animal is immobilized in a cylindrical plastic tube (Braintree Scientific) for 
30 to 45 min. All rats were singly housed with a 12 h standard light-dark cycle and 
when not undergoing water deprivation, food deprivation or restraint stress, water 
and food were available ad libitum. Experimental protocols were approved by 
Stanford University IACUC and meet guidelines of the NIH guide for the Care 
and Use of Laboratory Animals. 

Virus preparation. Virus preparation was as described previously® with ChR2- 
eYFP (mice), ChR2(H134R)-eYFP (rats only), eNpHR3.0-eYFP (mice) or the 
eYFP alone (all) inserted between incompatible loxP and lox2722 sites in reverse 
orientation. In all cases, the cassette was cloned into a pAAV2-MCS vector car- 
rying the EF-1% promoter, and the woodchuck hepatitis virus post-transcriptional 
regulatory element (WPRE) to enhance expression. All Cre-inducible recombi- 
nant AAV vectors were packaged with AAV5 coat proteins at the University of 
North Carolina Vector Core. The final viral concentration was 3 X 10’ particles 
per ml for ChR2 virus and 1.5 10" particles per ml for the eNpHR3.0 virus. 

Stereotactic injection into VTA, and cannula and fibre implantation. Mice in 
Figs 1 and 2 were aged 2 to 4 months. These mice were deeply anaesthetized with 
isofluourane and placed into a stereotactic apparatus. The head was levelled using 
Bregma and Lambda reference points and a craniotomy was preformed. Virus was 
injected at 2 sites in the VTA of the right hemisphere using a 33-gauge metal needle 
anda 10-l syringe controlled by an injection pump at 0.1 pl min”! for 2 pl: (1 plat 
anterior—posterior, —3.3; lateral-medial, 0.5; dorsal-ventral, —4.1; and 1 pl at 
dorsal-ventral, — 4.6). The needle was left in place after injection for 10 min before 
slowly being withdrawn. For all mice, an implantable fibre-optic light guide (IFL), 
consisting of a metal ferrule, 2.5 mm in diameter with a 200-mm thick, 5-mm long 
cleaved bare optic fibre (Doric Lenses) was implanted at anterior—posterior, —3.3; 
lateral-medial, +0.5; and dorsal-ventral, —3.9. In addition to this implant, mice 
used for pharmacological manipulations were also implanted with a bilateral guide 
cannula with a 1mm centre to centre distance for guides at coordinates (anterior- 
posterior, + 1.2; lateral-medial, + 0.5; lateral-medial, —0.5; dorsal-ventral, — 3.25), 
0.5 mm above the NAc. Instant adhesive was placed between the base of the cannula 
and the skull, and skull fixture adhesive was used to cement the cannula to the skull. 
Blue and yellow light delivery and protocols. A 200-11m patch cord was con- 
nected to the external portion of the chronically implantable optical fibre with a 
zirconia sleeve. Optic fibres were attached through an FC/PC adaptor to a 473-nm 
blue laser diode (no. BCL-473-050-M), and light pulses were generated through a 
stimulator (no. 33220A). For rats and mice expressing ChR2 and their eYFP 
controls, the light paradigm was 8 light pulses at 30 Hz every 5 s. Light-on epochs 
were 3 min for all assays (TST, OFT and FST) other than for anhedonia, for which 
the light epoch was 30 min. Optical-fibre light power from the patch cord was 


measured using a light sensor ($130A) and intensity calculated using a model 
based on empirical measurements from mammalian brain tissue for predicting 
irradiance values (http://www.stanford.edu/group/dlab/cgi-bin/graph/chart.php). 
For ChR2-transduced mice and controls, estimated light intensity at 0.5 mm from 
fibre tip ranged from 22.7 to 26.2mWmm ~~. For yellow light stimulation in 
eNpHR3.0 animals and eYFP controls, a 593-nm yellow laser was used; light 
intensity was calculated to be from 1.9 to 4.9 mW mm ”, and illumination was 
constant in light-on epochs. For rats, 300-j1m patch cords and FC/PC cables were 
used with 0.37 NA fibre, and light intensity ranged from 17.0 to 23.8mW mm ”. 
Pharmacological infusion of glutamate and dopamine receptor antagonists. 
For pharmacological experiments, drug or saline were infused in a volume of 0.4 pl 
through a 26-gauge stainless steel double internal cannula (PlasticsOne) that was 
0.5mm longer than the guide cannula. Each graph in Fig. 3 depicts a within- 
subjects matched comparison, counter-balanced for treatment order. The internal 
cannula was connected to a microsyringe pump by a PE20 tube. Solutions were 
administered at a constant rate of 100nl min‘, and the injection cannula was 
removed 2 min after the termination of the injection; TST was performed 10 min 
post infusion. For dopamine receptor antagonist experiments, 800 ng of SCH23390 
per 0.4 ul per side at a concentration of 6.16 mM SCH23390 was infused for anta- 
gonism of D1-like receptors and 400 ng per 0.4 il per side at a concentration of 
2.89 mM raclopride was infused for D2-like receptor antagonism. For glutamate 
receptor antagonism, 3 lg per 0.4 ul per side, at a concentration of 22.3mM for 
NBQX and 3 jig per 0.4 ul per side at a concentration of 38.04 mM for AP5 was used 
to antagonize AMPA (a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) 
and NMDA (N-methyl-p-aspartate) receptors, respectively. Each animal was 
experimentally tested using the TST, once post drug infusion and once post saline 
infusion, with order counterbalanced; the two tests were carried out in different 
contexts on different days. 

Immunohistochemistry. To determine the specificity of eNPHR3.0-eYFP expres- 
sion in dopamine neurons, TH::IRES-Cre mice transduced with the double-floxed 
AAV encoding eNpHR3.0-eYFP were anaesthetized and transcardially perfused 
with PBS followed by 4% paraformaldehyde (PFA) dissolved in PBS. Brains were 
removed and post-fixed in PBS containing 4% PFA overnight at 4°C, and subse- 
quently immersed in a cryoprotectant consisting of PBS containing 30% sucrose 
until settling (~48h at 4°C). Coronal brain sections (40 um) were collected and 
washed in PBS; blocking solution, primary and secondary solutions contained 0.3% 
Triton X-100 (PBST) and 3% normal donkey serum dissolved in PBS. Localization 
of dopamine cell bodies and fibres was confirmed by labelling with chicken anti- 
tyrosine hydroxylase antibody (1:300). Cell bodies were identified using the 4’,6- 
diamidino-2-phenylindole (DAPI) stain (1:50,000). Transduction efficiency was 
quantified using a confocal microscope by comparing the eYFP cells with TH- 
immunoreactive cells. 

Behavioural testing. For animals undergoing CMS, behavioural testing occurred 
after 8 to 12 weeks of CMS for mice (or 8 to 12 weeks of no CMS for non-CMS 
controls) or 4 to 6 weeks of CMS for rats. For NpHR animals and eYFP controls, 
behavioural testing occurred at least 1 month post surgery. All behaviour was 
conducted during the dark cycle (07:00 to 19:00). 

Open field test. The open-field test was conducted in an open plastic chamber 
(50 cm long, 50cm wide and 40cm deep). Mice were plugged into a patch cord 
connecting to the external portion of the chronically implanted optical fibre, indi- 
vidually placed in the centre of the chamber and allowed to freely explore for 12 min. 
Velocity of the animal in the field was measured using an automated video-tracking 
system (Viewer II, BiObserve). Measurement began immediately after placement 
in the chamber. Light stimulation occurred for minutes 3to 6 and 9to 12 only. 
Although no difference was detected, any trend towards subtly decreased loco- 
motion in the eNpHR3.0 group after illumination in the OFT would still be con- 
sistent with a depression-related phenotype, as related motor changes are clinically 
observed as psychomotor retardation and/or reduced motivation to explore*’. 
Tail-suspension test. Mice were plugged into a patch cord and the tail was placed 
between two strips of autoclave labelling tape. The end of one strip of tape was then 
secured to a horizontal bar 40 cm from the ground, ensuring that the animal could 
not make other contact or climb during the assay. Video recording was started 90 s 
from the time that the animal was inverted and taped; light stimulation was 
between minutes 3 to 6 of the assay. Time spent struggling was measured by blind 
scoring each minute of video material after the testing was completed, and was 
reported in seconds for each minute of the assay. 

Sucrose-preference test. TH::Cre males transduced with ChR2-eYFP or 
eNpHR3.0-eYFP were tested alternately with their respective eYFP-only controls. 
Animals were water-restricted overnight before exposure to the lickometer. A Med 
Associates operant chamber was used to count every tongue contact made (‘licks’) 
with either the 1% sucrose in water solution or the water alone. Bottle side and 
animal group tested were counterbalanced among chambers. In the 90-min test, 
the first 30 min of the assay were used to collect a baseline sucrose preference. Over 
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the next 30 min of the session (minutes 30-60), light delivery occurred through a 
jacketed patch cord, with blue light for ChR2 animals and eYFP controls (CMS 
and non-CMS groups) at 8 pulses at 30 Hz every 5 s, or constant yellow light for the 
eNpHR3.0 animals and their eYFP controls at the same light intensities used for 
TST assays. The last 60 to 90 min of the test was again a no-light post light delivery 
measure of sucrose preference. To ensure that all animals had consumed liquid 
from both spouts, identified the location of the sucrose and water spouts (counter- 
balanced across animals), and familiarized themselves with the placement of the 
bottles, light delivery between 30 and 60 min only occurred if both spouts had been 
used in the first 30-min baseline epoch. If the animal failed to lick at either spout 
during the first epoch (indicating that the mouse did not discover the spout and 
identify the solution within), the test was aborted at 30 min, the animal did not 
receive any light and the test was repeated on a different day. 

In vivo NAc electrophysiology. TH::Cre rats received AAV5-EFla-DIO-ChR2- 
eYFP to enable expression before CMS. Two small craniotomies were drilled uni- 
laterally over the VTA at the following coordinates: anterior—posterior, —5.4 and 
—6.2; lateral-medial, + 0.7. A bevelled 33-gauge needle was used to deliver 1.0 pl 
of virus at 2 depths in each hole (dorsal-ventral, —8.2 and —7.0; all coordinates 
from skull surface) for a total of 4.0 pil per animal. Each 1.0 ll of virus was infused 
at a speed of 100 nl min! using a syringe pump; the virus infuser was left in place 
for an additional 10 min after each injection before being removed slowly. After 
the CMS protocol described above, these animals had a 24-wire fixed electrode 
array implanted into the medial shell of the NAc (anterior—posterior, 1.6; lateral- 
medial, 1.3; dorsal—-ventral, —7.5). A third burr hole was drilled (anterior- 
posterior, —5.8; lateral-medial, + 0.7; dorsal-ventral —7.5) for the insertion of 
an implantable optical fibre targeted just dorsal to the VTA. Ground wires were 
implanted at a depth of approximately 2mm lateral to the optical fibre. After 
behavioural and recording experiments, electrolytic lesions were created just 
before sacrifice to allow histological identification of electrode-tip location. 
Forced swim test. On day 1, a 15-min pre-test swim was conducted for each 
animal, and on day 2 rats were plugged into a patch cord and a HS-27 pre-amplifier 
head stage with 24 electrode channels and 3 reference channels. The patchcord 
and HS-27 tether were waterproofed with a latex tube before being connection. 
Immediately after a 5-min recording in the home cage and a 12-min OFT, the rat 
was placed into a cylindrical tank of water (25 to 26 °C), in which the animal could 
not touch the bottom of the tank. Minutes 6 to 51 were used for analysis of neural 
data. Light stimulation occurred during minutes 12 to 15, 21 to 24, 30 to 33, 39 to 
42 and 48to 51, to create an alternating 6 min off, 3 min on light-stimulation 
paradigm. Magnetic induction was used to provide a high temporal-resolution 
readout; swimming activity (referred to as ‘kicks’ or ‘kick events’) was timed and 
quantified with a coil and a hind-paw magnet to give a continuous analogue signal. 
This continuous signal was then thresholded at 3 standard deviations from the 
mean for all rats through both light-on and light-off epochs in the FST. 
Analysis of in vivo awake-behaving NAc electrophysiological recordings. To 
test the relevance of phasic-firing-rate changes for individual neurons in relation 
to either light-pulse onsets or kick-signal threshold crossings (Fig. 4), two time 
windows relative to the event time were used. For the statistical analyses of neural 
responses to the first light pulse in each 8-pulse train and kick signals (all trials 
were included unless otherwise stated), a baseline window was compared to a 
response window for each reference event. For the reference event referred to as 
‘light’, the event was the onset of the first light pulse in each 8-pulse train occurring 
every 5s; we selected a baseline window from t = —3s to —2:s, and selected the 
response window that initiated after first light-pulse onset and spanned from 
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t = 0sto 1s. For kick events (animal-initiated rather than experimenter-initiated), 
we used a baseline window further from the event onset given the variable inter- 
event interval latency; —4 s to —2 s was used as a measure of baseline firing rate for 
all kick events, and a response window from —0.5 s to 0.5 s was used as a measure 
of the response. Bin size for rate calculations was 200 ms. For all phasic responses 
discussed in this study, the fractional change in the average firing rate between the 
baseline and response window was computed across all event occurrences (for 
events, the number of events ranged from 1,971 to 4,718 events), and the signifi- 
cance of this change was determined by its percentile within a non-parametric 
bootstrap distribution estimated using 500 random circular permutations of spike 
times relative to events times, in which P values of less than 0.05 were considered 
significant responses. For all classes of reference events, the number of significantly 
modulated neurons statistically exceeded the expected number of false positives 
given multiple comparisons (95% confidence interval of 2 to 11 false positives 
occurring, binomial test, n = 123, « = 0.05). Mean firing-rate changes are indi- 
cated in Supplementary Fig. 17 and described in the main text. 

Extended, counterbalanced open-field test. For data shown in Supplementary Fig. 
3, the open-field test was conducted in an open plastic chamber (50 cm long, 50 cm 
wide and 40 cm deep). Mice were plugged into a patch cord connecting to the external 
portion of the chronically implanted optical fibre, individually placed in the centre of 
the chamber and allowed to freely explore for 30 min. Velocity and distance travelled 
were measured using an automated video-tracking system (Viewer II, BiObserve). 
Measurements began immediately after animals were placed in the chamber. For 
one-half of each group, light stimulation occurred for minutes 0 to 10 and 20 to 30, 
and for the remaining half, light stimulation occurred from minutes 10 to 20. 

In vivo VTA electrophysiology. Adult (>300-g) male Long Evans rats under- 
went either the rat CMS (n = 5) protocol described above or spent equal time in a 
quiet, low-traffic animal housing facility with water and food ad libitum (non- 
CMS; n = 4). In the last week of the treatment, animals were deeply anaesthetized 
and implanted with a 12- or 24-wire array into the VTA (anterior-posterior, —5.8; 
lateral-medial, + 0.7; dorsal-ventral, —8.2) as described with NAc recording sub- 
jects. After recovery and before recording, subjects were connected to an HS-27 
(Neuralynx), returned to the home cage and given 45 min to habituate to the 
tether. After these 45 min, approximately 60-min recordings were carried out in 
the home cage for each animal. After recording experiments, electrolytic lesions 
were created just before animals were euthanized to enable histological identifica- 
tion of electrode-tip location (Supplementary Fig. 10). 

Burst-firing analysis. Bursts were defined as beginning when 2 spikes occurred 
with an inter-spike interval of less than 80 ms and as ending when the inter-spike 
interval was greater than 135 ms, as described by previously* (see also refs 33, 34). 
Spike sorting was performed using the Offline Sorter (Plexon) and burst analyses 
were calculated using NeuroExplorer (Plexon), with the parameters specified 
above, and the minimum number of spikes in a burst was set at two. 


31. Lemke, M. R., Wendorff, T., Mieth, B., Buhl, K. & Linnemann, M. Spatiotemporal 
gait patterns during over ground locomotion in major depression compared with 
healthy controls. J. Psychiatr. Res. 34, 277-283 (2000). 

32. Ungless,M.A., Magill, P.J.& Bolam, J.P. Uniform inhibition of dopamine neurons in 
the ventral tegmental area by aversive stimuli. Science 303, 2040-2042 (2004). 

33. Grace, A.A. & Bunney, B. S. The control of firing pattern in nigral dopamine 
neurons: burst firing. J. Neurosci. 4, 2877-2890 (1984). 

34. Freeman, A.S. & Bunney, B. S. Activity of A9 and A10 dopaminergic neurons in 
unrestrained rats: further characterization and effects of apomorphine and 
cholecystokinin. Brain Res. 405, 46-55 (1987). 
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Cancer cells acquire distinct metabolic adaptations to survive 
stress associated with tumour growth and to satisfy the anabolic 
demands of proliferation. The tumour suppressor protein p53 
(also known as TP53) influences a range of cellular metabolic pro- 
cesses, including glycolysis’*, oxidative phosphorylation’, gluta- 
minolysis*’ and anti-oxidant response’. In contrast to its role in 
promoting apoptosis during DNA-damaging stress, p53 can pro- 
mote cell survival during metabolic stress’, a function that may 
contribute not only to tumour suppression but also to non-cancer- 
associated functions of p53*. Here we show that human cancer cells 
rapidly use exogenous serine and that serine deprivation triggered 
activation of the serine synthesis pathway and rapidly suppressed 
aerobic glycolysis, resulting in an increased flux to the tricarboxylic 
acid cycle. Transient p53-p21 (also known as CDKN1A) activation 
and cell-cycle arrest promoted cell survival by efficiently channel- 
ling depleted serine stores to glutathione synthesis, thus preserving 
cellular anti-oxidant capacity. Cells lacking p53 failed to complete 
the response to serine depletion, resulting in oxidative stress, 
reduced viability and severely impaired proliferation. The role of 
p53 in supporting cancer cell proliferation under serine starvation 
was translated to an in vivo model, indicating that serine depletion 
has a potential role in the treatment of p53-deficient tumours. 

As p53 contributes to the survival of cells deprived of glucose’, we 
investigated whether removal of other nutrients found in normal med- 
ium induced a differential response in p53*/* and p53 _‘~ HCT116 
cells. Whereas removal of the non-essential amino acids serine and 
glycine impaired proliferation of p53*’* cells, p53 ‘~ cells showed a 
more marked loss of proliferation (Fig. 1a) and substantial loss of 
viability (Fig. 1b, c). The contribution of p53 to growth and survival 
during serine and glycine depletion was also seen in RKO cells (Sup- 
plementary Fig. 2a-c) and primary mouse embryonic fibroblasts 
(MEFs; Supplementary Fig. 2d). By removing serine or glycine indi- 
vidually, we established that serine depletion was the major contri- 
butor to the starvation phenotype (Fig. la-c), as removal of glycine 
alone had no detrimental effect. Whereas serine and glycine may be 
inter-converted by serine hydroxymethyltransferase (SHMT)), serine to 
glycine conversion supports proliferation via methyl-tetrahydrofolate 
(THF) production (Supplementary Fig. 1). The reverse reaction (gly- 
cine to serine) depletes methyl-THF, which is presumably why excess 
glycine has been shown to inhibit proliferation”'®. As expected, remo- 
val of lysine (an essential amino acid) did not cause a differential 
response, being equally incompatible with proliferation in p53*/* 
and p53 ‘~ cells (Supplementary Fig. 2e). 

Analysis of cell culture medium by liquid chromatography—mass 
spectrometry (LC-MS) revealed rapid serine consumption by p53*’* 
and p53 '~ cells (Fig. 1d), whereas glycine uptake was low (Sup- 
plementary Fig. 2f). A recent screen of NCI-60 cancer cell lines showed 
that elevated SHMT expression and glycine uptake were correlated with 
rapid proliferation’’. Notably, all 60 lines consumed more serine than 
glycine, including seven lines with the shortest doubling times (<22h), 
which on average consumed 7.7-fold more serine than glycine’’. 
Furthermore, cells with the shortest doubling times and highest glycine 


uptake also consumed most of the available serine", raising the possibi- 
lity that these highly proliferative cells switch to glycine consumption 
because they have exhausted the available serine. Overall this demon- 
strates that cancer cells avidly consume serine, which may, through high 
SHMT expression, be used to generate glycine and methyl-THF. 

Unlike essential amino acids, the chronic depletion of non-essential 
amino acids can be tolerated in vivo. Mice tolerated diets lacking serine 
and glycine well (Supplementary Fig. 3a), and LC-MS confirmed a 
significant drop in serum levels of serine and glycine, but not other 
amino acids (Supplementary Fig. 3b). HCT116 cells rapidly formed 
tumours in animals fed control diet, without a significant difference in 
volume between p53*’* and p53-‘~ tumours (Fig. le). However, 
animals fed matched diet lacking serine and glycine displayed signi- 
ficant reduction in the volume of tumours of both genotypes, and 
survived significantly longer before tumour size or ulceration end- 
points (Fig. le, f). As with our in vitro studies, serine and glycine 
starvation had a more dramatic effect on p53 ‘~ xenografts, which 
had significantly reduced volume compared to p53*’* tumours in 
serine- and glycine-deprived animals (Fig. le). 

Mammalian cells synthesize serine de novo by channelling the gly- 
colytic intermediate 3-phosphoglycerate into the ‘phosphorylated 
pathway’ of serine synthesis'* (Supplementary Fig. 1). Flux through 
this biosynthetic pathway is controlled primarily by the demand for 
serine, rather than the availability of substrate 3-phosphoglycerate’’. 
The serine synthesis pathway (SSP) supports anabolism by providing 
precursors for biosynthesis of proteins, nucleotides, creatine, porphy- 
rins, phospholipids and glutathione, and SSP upregulation occurs in 
some breast cancers'*"®. A recent study demonstrated that serine star- 
vation activates the SSP'’; we found that serine starvation induced 
strong p53-independent upregulation of PHGDH and PSAT1, with 
a modest increase in PSPH (Fig. 1g and Supplementary Fig. 4a, b). The 
failure of p53" cells to proliferate during serine starvation could 
not therefore be attributed to a deficiency in SSP enzyme expression. 
p53 has been shown to downregulate PGAM'*—potentially allowing 
3-phosphoglycerate to be channelled to the SSP. However, PGAM 
expression did not vary greatly during serine starvation (Supplemen- 
tary Fig. 4a, b). Consistent with their ability to activate the SSP, both 
p53*’* and p53 ‘~ cells achieved de novo serine synthesis, as detected 
using [U-’*C] glucose labelling (Fig. 1h). However, p53 ‘~ cells had 
lower serine levels, indicating some defect in the ability of these cells to 
adapt to de novo serine synthesis. We therefore sought to explore the 
mechanisms through which cells adapt to serine starvation. 

The mTOR pathway senses amino acid availability, and whereas 
mTORC]1 activity was lowered by serine starvation, it was maintained 
at very similar levels in p53*’* and p53 ‘~ cells (Supplementary Fig. 5). 
This demonstrated that the effect of serine starvation on mTORCI was 
p53-independent and therefore unlikely to contribute to the enhanced 
sensitivity of p53 /~ cells. A similar maintenance of mTORC] activity 
in serine-starved cells has recently been shown, and is promoted by 
PKM2 expression”. Serine activates PKM2” and decreased PKM2 acti- 
vity following serine starvation causes an accumulation of upstream 
glycolytic intermediates for diversion to the SSP”. To balance lower 


1The Beatson Institute for Cancer Research, Switchback Road, Glasgow G61 1BD, UK. 
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Figure 1 | p53 promotes cell survival and proliferation during serine 
starvation in vitro and in vivo. a, HCT116 cells were grown in complete 
medium (containing serine and glycine) or equivalent medium lacking these 
amino-acids (averages of triplicate wells). p53 | ~ (lex), deletion of p53 exon 2; 
p53 (3ex), deletion of p53 exons 2, 3 and 4. b, ¢, Viability of HCT116 cells was 
assessed by analysing sub-G1 DNA content (b, m = 3) and propidium iodide (PI) 
exclusion (c, n = 3). d, LC-MS was used to determine the relative consumption 
of serine by HCT116 cells fed complete medium (averages of triplicate wells 
versus fresh medium). e, Nude mice were subcutaneously injected with HCT116 
cells (p53*’* right flank, p53 /~ left flank), and fed diet with or without serine 
and glycine. Tumour volume is plotted until the first animal in each group 
reached the experimental end-point (*P < 0.05 control diet group versus —Ser & 
Gly group; **P < 0.05 for p53*’* versus p53 ‘~ within -Ser & Gly group). 

f, Kaplan-Meier plot of survival until experimental end-point for diet groups 
(mean survival; control = 33.3 days (n = 10), —Ser & Gly = 53.2 days (n = 8), 
log rank P = 0.001, Wilcoxon P = 0.003). g, Expression of glycolytic and SSP 
genes (averages of triplicate quantitative PCR). h, Intracellular serine levels in 
HCT116 cells fed medium containing [U-'*C]glucose were measured by LC-MS 
(averages of triplicate wells). Labelled serine (M+3, mass isotopomer with 3 ‘°C 
incorporated) and unlabelled serine (M+0, mass isotopomer with 0 °C 
incorporated) is shown in cells grown in complete (Com) or serine- and glycine- 
deficient (—) medium for the indicated times. All error bars are s.e.m. 


glycolysis following PKM2 inhibition, cells increase flux of pyruvate to 
the tricarboxylic acid (TCA) cycle, requiring cells depleted of PKM to 
display increased O2 consumption to support elevated oxidative phos- 
phorylation”. Both p53*’* and p53-‘~ cells displayed elevated phos- 
phoenolpyruvate (PEP) levels and decreased pyruvate and lactate 
levels, evidence of low PKM2 activity following serine starvation 
(Fig. 2a). The importance of oxidative phosphorylation during serine 
starvation was demonstrated by treatment with the mitochondrial ATP 
synthase inhibitor Oligomycin (Fig. 2b), which completely inhibited 
the growth of serine-deprived p53"’* cells. As p53 supports oxidative 
phosphorylation*”!”*, we considered the possibility that p53 ‘~ cells 
would be unable to upregulate oxidative phosphorylation in response 
to serine starvation. 
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As expected, p53-'~ cells showed lower O, consumption than 
p53°’* cells under fed conditions. Surprisingly, p53 ‘~ cells re- 
sponded to serine starvation with increased O consumption, whereas 
p53°’* cells showed lower O, consumption (Fig. 2c). Closer analysis 
of metabolic flux into the TCA cycle revealed that whereas both 
p53'’* and p53 ‘~ cells showed an increase in glycolytic TCA cycle 
flux in the immediate response to serine starvation, this response was 
reversed over time in p53*’~ cells, but sustained in the p53" cells 
(Fig. 2d, e). This correlated with the changes in O, consumption 
observed during serine starvation, indicating that the initial response 
of p53°’* and p53 ‘~ to serine depletion is similar, but p53 ‘~ cells 
sustain a metabolic profile indicative of low PKM2 activity, including 
elevated O2 consumption. 

We predicted that the disruption to glycolysis in the immediate 
response to serine starvation would impede ATP production, indeed 
ATP levels dropped in both p53*/* and p53 ‘~ cells (Fig. 2f). We 
considered that as glycolytic flux to pyruvate is lowered by serine 
starvation, increasing pyruvate levels (by adding exogenous pyruvate) 
would potentially alleviate the ATP shortage by further enhancing 
flux to the TCA cycle. We observed a significant recovery in prolifera- 
tion of p53 ‘~ cells after adding pyruvate (Fig. 2g). LC-MS analysis of 
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Figure 2 | Serine starvation differentially changes energy metabolism in 
p53"’* and p53"‘~ cells. a, HCT116 cells were fed complete (Com) or 
serine- and glycine-deficient (—SG) medium for 24h, in the presence of 
(u-¥c] glucose for the final 2 h. LC-MS was used to detect relative intracellular 
quantities of glycolytic intermediates (averages of triplicate wells). b, HCT116 
cells were grown with or without serine, glycine and oligomycin (OL) 1 ng ml * 
(averages of triplicate wells). c, Oxygen consumption rate of HCT116 cells was 
measured after 48 h of serine and glycine starvation (n = 3, *P < 0.05). 

d, e, Relative intracellular levels of TCA cycle intermediates in HCT116 cells 
deprived of serine and glycine (in the constant presence of [U-'*C] glucose) 
were analysed by LC-MS (averages of triplicate wells) (d) and, after long-term 
starvation, with [U-'°C] glucose added for the final hour (averages of triplicate 
wells) (e). f, ATP levels were measured in HCT116 cells (n = 3). g, HCT116 
p53 '~ (lex) cells were grown in complete medium or medium lacking serine 
and glycine with or without pyruvate 5 mM (averages of triplicate wells). All 
error bars are s.e.m. 
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serine-starved p53 ‘~ cells fed unlabelled pyruvate (in the presence of 
[U-'°C] glucose) confirmed increased TCA cycle flux (Supplementary 
Fig. 6). However, this enhanced TCA cycle flux was not sufficient to 
fully restore proliferation (Fig. 2g). Importantly, although it has 
recently been shown that serine can be synthesised from pyruvate 
(reviewed in ref. 30), we detected only labelled serine in the cells fed 
unlabelled pyruvate (Supplementary Fig. 6), demonstrating that the 
serine in these cells was derived from glucose. 

Serine contributes to the synthesis of purines via glycine, which 
is incorporated into GMP and AMP via inosine monophosphate 
(IMP). Serine starvation resulted in lower GMP and AMP levels in 
p53’ * and p53 | cells (Fig. 3a). Previous studies show that GMP 
depletion can activate p53-dependent G1 arrest****. Consistently, we 
found that serine starvation led to a small elevation in p53 expression, 
accompanied by more marked, but transient, elevation of the p53 
target protein p21 (Fig. 3b). Treatment of p53*’* cells with low doses 
of mycophenolic acid (an inhibitor of GMP synthesis) replicated 
this subtle p53-p21 response (Supplementary Fig. 7a, b). Recruit- 
ment of p53 to the p21 promoter during serine starvation was 
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Figure 3 | Serine starvation causes recruitment of p53 to the p21 promoter 
and activation of a transient p21-dependent G1 arrest. a, LC-MS was used to 
quantify total relative amounts of intracellular purine nucleotides GMP and 
AMP in serine and glycine fed (Com) and starved (—SG) HCT116 cells 
(averages of triplicate wells). b, p53 and p21 protein expression was quantified 
in p53 -’* HCT116 cells via western blot and detection with infrared- 
conjugated secondary antibodies (n = 3). CDK4 expression was used as a 
control for normalisation. c, Chromatin-immunoprecipitation (ChIP) was 
performed for p53 with quantitative PCR for two p53-response elements 
(—2350 bp and —1450 bp) and the transcription initiation region (+50 bp) of 
the p21 promoter (n = 3). HU, hydroxyurea. d, BrdU labelling and PI staining 
followed by flow cytometry were used to assess cell cycle (n = 3). e, p21 was 
transiently knocked down in p53*’* HCT116 cells using short interfering RNA 
(siRNA, averages of triplicate wells). f, p21~'” HCT116 cells (retaining wild- 
type p53) were grown in medium with or without serine and glycine (averages 
of triplicate wells). All error bars are s.e.m. 
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confirmed by chromatin-immunoprecipitation (Fig. 3c). As only a 
modest increase in p53 expression was observed, we speculate that 
p53 was activated via post-translational modification, rather than 
MDM2 inhibition. Although our data are consistent with p53 acti- 
vation in response to nucleotide depletion, activation of AMPK 
(Supplementary Fig. 7c) due to lower ATP levels could also contribute 
to the p53 response’. 

As expected following p53 activation, p53*’* cells starved of serine 
and glycine initially showed an accumulation of cells in G1 and 
reduced S-phase, correlating with the increased expression of p21 
(Fig. 3d). By 48h, these cells resumed a normal cell cycle. By compa- 
rison, p53 '~ cells failed to establish a strong G1 arrest, showing a 
more gradual decrease in S-phase (corresponding to an increase in 
sub-Gl cells, Fig. 1b) and did not recover normal cell cycle (Fig. 3d). 
Because p21 is a major mediator of p53-induced G1 arrest”*, we exam- 
ined the effect of p21 loss in p53*’* cells. p21~'~ (CDNKIA ‘" ) cells 
showed a similar cell-cycle response as p53 ‘~ cells (Fig. 3d), indi- 
cating that induction of p21 may be critical for adaptation to serine 
starvation. Indeed, p53*’" cells depleted of p21 by siRNA were unable 
to increase in number without serine (similar to p53! ~ cells, Fig. 3e), a 
response even more notable in cells genetically deleted of p21 (Fig. 3f). 

We next considered whether p53°’* and p53 ‘~ cells use the low 
levels of intracellular serine available under conditions of starvation 
differently. Specifically, we analysed the balance between purine- 
nucleotide and glutathione synthesis, both of which require serine/ 
glycine. LC-MS showed that after 24h, p53 /~ cells retained flux 
of de novo serine/glycine into IMP (M+7), but this was inhibited 
in p53‘’* cells (Fig. 4a). Re-feeding serine-starved cells with 
[U- 3C,°N]L-serine confirmed sustained serine flux into IMP, GMP 
and AMP in p53 ‘~ cells (Supplementary Fig. 8a). However, in serine- 
starved p53'’* cells, flux of the replenished labelled serine to 
nucleotides was blocked, despite plentiful intracellular serine. This 
demonstrated that a feature of p53-p21-induced cell-cycle arrest in 
response to serine starvation is inhibition of nucleotide synthesis, 
a known function of p21*°’’. Reduced glutathione (GSH) is the prin- 
cipal cellular anti-oxidant, and we found that levels of GSH dropped 
significantly in p53*/* and p53 ‘~ serine-starved cells (Fig. 4b). 
However, in contrast to nucleotides, po3'’* cells maintained and, over 
time, enhanced flux to GSH synthesis during starvation. Notably, this 
maintenance of flux to glutathione was not seen in p53 '~ and p21 /~ 
cells (Fig. 4b and Supplementary Fig. 8b). Consequently, total GSH 
levels showed recovery in p53*’* but not p53-‘~ or p21-'~ cells 
(Fig. 4c and Supplementary Fig. 8c). 

The failure of p53‘ cells to recover GSH levels combined with 
their elevated oxygen consumption suggested that these cells would 
accumulate increased intracellular reactive oxygen species (ROS) 
levels. p53 has well-established anti-oxidant functions’®, but hydrogen 
peroxide treatment demonstrated that p53"‘~ cells were only slightly 
more susceptible than p53‘’* to oxidative stress under normal con- 
ditions (Fig. 4d). Although serine and glycine starvation increased the 
sensitivity of both genotypes to peroxide treatment, this effect was 
more marked in p53 '~ cells (Fig. 4d), and rescued by adding GSH 
(Fig. 4e). Staining with an oxidation-activated fluorescent dye con- 
firmed increased intracellular ROS in p53’ ‘~ cells during serine star- 
vation (Fig. 4f and Supplementary Fig. 9a). To assess the importance of 
ROS in limiting proliferation, we tested the effect exogenous GSH or 
N-acetyl cysteine. Whereas anti-oxidant treatment alone modestly 
improved proliferation of serine-starved p53 /~ cells, either of the 
ROS-limiting treatments almost completely rescued the proliferation 
of serine-starved p53 ‘~ cells in combination with pyruvate (Fig. 4g). 
We confirmed that the exogenous GSH did not lead to accumulation 
of intracellular glycine or serine, demonstrating that rescue was 
achieved by increasing the intracellular GSH pool (Supplementary 
Fig. 9b). Adding GSH to serine-starved p53*’" cells did not greatly 
enhance proliferation, supporting the theory that p53*’* cells are able 
to maintain GSH pools independently (Supplementary Fig. 9c). 
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Figure 4 | p53-p21 activation allows serine-deprived cells to synthesize GSH 
in preference to nucleotides. a—c, LC-MS was used to detect relative 
intracellular quantities of IMP (a) and GSH (b) in HCT116 cells fed complete 
medium (Com) or medium lacking serine and glycine (—SG) for 24h, in the 
presence of [U-'?C] glucose for the final 2h, and in HCT116 cells fed -SG 
medium for the indicated times in the presence of [U-°C] glucose for the final 
hour (averages of triplicate wells) (c). d, e, HCT116 cells were grown with or 


Our data therefore show that the sensitivity of p53" ‘~ cells to serine 
depletion is due to a combination of impaired glycolysis and elevated 
ROS. Whereas the initial response of cells with or without p53 to serine 
depletion was similar, p53’ * cells underwent p21-dependent G1 
arrest, allowing the limited levels of de novo serine to be channelled 
to GSH production to counter oxidative stress. Depletion of p21 (while 
retaining p53) caused severe sensitivity to serine depletion (Fig. 3e, f), 
indicating that activation of other arms of the p53 response may con- 
tribute to the death of these cells. Our data also demonstrate that the 
ability of p53-deficient cells to engage higher rates of oxidative phos- 
phorylation is not entirely defective, but that oxidative phosphoryla- 
tion in these cells is probably limited by a requirement to prevent ROS 
generation. This observation ties in with the suggestion that cancer 
cells adopt aerobic glycolysis (the Warburg effect) to avoid generation 
of metabolic ROS”. It has recently been shown that increased ROS 
levels inhibit PKM2”, providing an explanation for why p53 /~ cells 
show evidence of sustained PKM2 inhibition during serine starvation. 

In conclusion, our study underlines the importance of p53 in coor- 
dinating metabolic remodelling in response to metabolic stress. We 
demonstrate that serine uptake supports the Warburg effect, indi- 
cating that many cancer cells may show some sensitivity to serine 
depletion, particularly those lacking p53. However, it is likely that 
other genetic alterations (such as PHGDH amplification’*"'*) may 
circumvent serine-dependence in other cancer cells. Taken together, 
our work suggests the therapeutic utility of serine depletion—either by 
removal from the diet, enzymatic depletion in vivo, or other means—is 
worthy of further investigation. 


METHODS SUMMARY 


HCT116 p53*/*/p21*’* (parental), p53 /~ (Lex) (deletion of p53 exon 2), p53. /~ 
(3ex) (deletion of p53 exons 2, 3 and 4), p21‘ and RKO p53'* and p53 ‘~ cells 
were a gift of B. Vogelstein. Litter-matched p53'’* and p53 '‘~ MEFs were pre- 
pared from embryonic day 14.5 (E14.5) embryos derived from mating p53*/~ 
mice. For starvation, cells were initially grown in DMEM (which contains serine 
0.4mM and glycine 0.4 mM); complete medium (Com) was formulated by sup- 
plementing MEM with additional nutrients to closely match the nutrient content 
of DMEM. Serine- and glycine-deficient medium (—SG) was formulated in the 
same way as complete medium, without addition of serine and glycine. Bilateral 
subcutaneous injections of HCT116 cells were carried out on CD-1-Foxn1™" 
female mice (Charles River); p53’ * on right flank and p53! ~ (lex) on the left. 
Following injection, mice were placed either on control diet (containing serine and 
glycine as part of the amino-acid mix) or diet deficient in serine and glycine 
(TestDiet, International Product Supplies). The diets had equal calorific value 
and equal total amino acid content. Animals were housed in sterile IVC cages, 


without serine and glycine (d) or medium lacking serine and glycine with 5 mM 
glutathione (GSH) (e) for 48 h in the presence of hydrogen peroxide (H,O>) for 
the final 24h. f, HCT116 cells were treated with an oxidation-activated 
fluorescent dye and analysed by flow cytometry. g, HCT116 p53~/~ (lex) cells 
were grown with or without serine, glycine, pyruvate 5 mM (Pyr) and/or GSH 
5 mM, or N-acetyl cysteine 0.2 mM (NAC) (averages of triplicate wells). All 
error bars are s.e.m. 


monitored thrice weekly and killed humanely when tumours reached clinical 
endpoint of predetermined size (volume = (length < width’)/2) or ulceration. 
All animal work was undertaken in line with the UK Animals (Scientific 
Procedures) Act of 1986 and the EU directive 2010. Cell-cycle analysis was per- 
formed by labelling with 5-bromo-2'-deoxyuridine (BrdU) and propidium iodide 
(PI) staining followed by flow cytometry. Oxygen consumption rates were mea- 
sured with an XF-24 extracellular flux analyser (SeaHorse Bioscience). ROS levels 
were detected with CellROX deep red reagent (Invitrogen). ATP levels were 
assayed with a luciferase-based kit. The relative quantities of intracellular meta- 
bolites after labelling with (U-8c] glucose or [U-°C,!°N]-serine were analysed 
by LC-MS and plotted as the peak area for each metabolite. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture. Unless otherwise stated, chemicals were obtained from Sigma- 
Aldrich, and cell culture reagents from Gibco (Invitrogen). HCT116 p53*/*/ 
p2i nee (parental), p53 /~ (lex) (deletion of p53 exon 2), p53! (3ex) (deletion 
of p53 exons 2,3 and 4), p21 /~ and RKO p53‘/* and p53 ‘~ cells werea gift of B. 
Vogelstein*'. Litter-matched wild-type and p53‘ MEFs (passage < 4) were pre- 
pared from E14.5 embryos derived from mating p53*/ mice™* (C57Bl6/J back- 
ground, >20 gen). Cells were kept at 37 °C in humidified 5% CO, in air, stock 
HCT116 and RKO cells were maintained in McCoys 5A medium (26600) contain- 
ing 2mM L-glutamine and 10% FBS (PAA Laboratories). For experiments cells 
were seeded in DMEM (21969) with 10% FBS and 2 mM L-glutamine. Complete 
(Com) medium was formulated to closely match the nutrient composition of 
DMEM (which contains 0.4mM serine and 0.4mM glycine); complete medium 
consisted of MEM (21090) supplemented with additional 1x MEM vitamins 
(11120), 10% dialysed-FBS (Hyclone, Thermo Scientific), L-glutamine 2 mM, 
additional D-glucose (to 25 mM), serine 0.4mM and glycine 0.4 mM. For star- 
vation experiments cells were fed the same medium formulation without serine 
and glycine (—SG medium). 

Proliferation assays. HCT116 and RKO cells were seeded in 24-well plates 
(8 X 10* cells per well), MEFs in 12-well plates (p53*’* 2 X 10* cells per well, 
p53‘ 1X 10° cells per well) in DMEM and allowed to grow for 16-24h. Cells 
were then washed with PBS and received complete or -SG medium supplemented 
with the stated nutrients and/or drugs. To maintain constant nutrient levels and 
remove nutrients liberated from dead cells, medium was replaced every 24h. Cell 
counts were performed with a CASY TT cell counter (Innovatis, Roche Applied 
Science). siRNA for p21 (sc-29427, Santa Cruz Biotechnology, UGUCAGAACC 
GGCUGGGGA and UCCCCAGCCGGUUCUGACA) or non-targeting control 
(si-ctr, ON-TARGETplus, Dharmacon D-001810-10-20, UGGUUUACAUGU 
CGACUAA, UGGUUUACAUGUUGUGUGA, UGGUUUACAUGUUUUCU 
GA and UGGUUUACAUGUUUUCCUA) was transfected with Metafectene SI 
(Biontex) in DMEM for 24h before washing and adding complete or -SG medium. 
Xenografts. Bilateral subcutaneous injections of 3 X 10° HCT116 cells were car- 
ried out on 8 week CD-1-Foxn1™" female mice (Charles River); p53’ * on right 
flank and p53 ‘~ (lex) on the left. Immediately following injection mice were 
placed either on control diet (n = 10) (containing serine and glycine as part of the 
amino-acid mix) or diet deficient in serine and glycine (n= 10) (TestDiet, 
International Product Supplies), formulations as follows. Control diet ingredients: 
sucrose (25.9%), corn starch (41.8%), corn oil (5.0%), Baker amino acid vitamin 
mix (0.2%), Baker amino acid mineral mix (10.0%), sodium bicarbonate (1.0%), 
DL-alpha tocopheryl acetate (0.004%), ethoxyquin (preservative, 0.019%), choline 
chloride (0.1%), amino acid premix (16.0%). Amino acid pre-mix: L-arginine- 
HCL (1.60%), L-cystine (0.64%), L-glutamine (1.60%), glycine (1.33%), L- 
histidine-HCL (0.80%), L-isoleucine (1.07%), L-leucine (1.60%), L-lysine-HCL 
(1.87%), L-methionine (0.80%), L-phenylalanine (1.07%), L-serine (1.33%), 
L-threonine (1.07%), L-tryptophan (0.27%), L-tyrosine (0.53%), L-valine (1.07%). 
The serine- and glycine-free diet has the same basic formulation as the control diet, 
but the amino acid mix lacks serine and glycine. Serine- and glycine-free diet 
ingredients: sucrose (25.9%), corn starch (41.8%), corn oil (5.0%), Baker amino 
acid vitamin mix (0.2%), Baker amino acid mineral mix (10.0%), sodium bicar- 
bonate (1.0%), pL-alpha tocopheryl acetate (0.004%), ethoxyquin (preservative, 
0.019%), choline chloride (0.1%), amino acid premix (16.0%). Amino acid pre- 
mix: L-arginine-HCL (1.60%), L-cystine (0.64%), L-glutamine (1.60%), L-histidine- 
HCL (0.96%), L-isoleucine (1.28%), L-leucine (1.92%), L-lysine-HCL (2.24%), 
L-methionine (0.96%), L-phenylalanine (1.28%), L-serine, L-threonine (1.28%), 
L-tryptophan (0.32%), L-tyrosine (0.64%), L-valine (1.28%). 

The diets had equal calorific value and equal total amino acid content. Animals 
were housed in sterile IVC cages, monitored thrice weekly and killed humanely 
when tumours reached clinical endpoint of predetermined size (volume = (length 
 width”)/2) or ulceration. All animal work was approved by the Ethical Review 
Process (University of Glasgow) and undertaken in line with the UK Animals 
(Scientific Procedures) Act of 1986 (PPL 60/4181) and the EU directive 2010. 
Western blot. Cells were seeded in 6-well plates and grown in DMEM for ~40h. 
Cells were washed with PBS then received complete or -SG medium. Cell numbers 
were titrated at the time of seeding to be ~90% confluent at protein isolation. Cell 
lysates were prepared in RIPA buffer with complete protease inhibitors (Roche), 
resolved via PAGE and transferred to nitrocellulose. Primary antibodies: phosho- 
p70S6K (9206) and total-p70S6K (2708), phospho-S6 (2215) and total-S6 (2217), 
AMPK-« (2532) and phospho-AMPK-« (2535) all from Cell Signaling 
Technology, PHGDH (Sigma Life Science, HPA021241), PSAT1 (Novus 
Biologicals, 21020002), PSPH (sc-98683), CDK4 (sc-260), p53 DO-1 (sc-126), 
p21 (sc-397) and PGAMI (sc-130334) all from Santa Cruz Biotechnology. 
Secondary antibodies were IRDye800CW- or IRDye6980LT-conjugated (LiCor 
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Biosciences), and were detected using an Odyssey infrared scanner (LiCor 
Biosciences) and quantified with Odyssey software. 

Oxygen consumption rates (OCR). An XF24 Extracellular Flux Analyser 
(SeaHorse Bioscience) recorded OCR. HCT116 cells were grown in XF-plates 
for 48h in complete or -SG medium (90-100% confluent at measurement). 
OCR was recorded after equilibration and after CCCP (0.5 1M) and antimycin 
(1.5 uM) treatments, these values were used to calculate basal and maximal OCR. 
All OCR measurements were normalized with well-by-well haemocytometer 
cell counts. 

Peroxide sensitivity assay. HCT116 cells were seeded in 24-well plates in DMEM. 
After 16-20h cells were washed with PBS and received complete or serine- and 
glycine-deficient medium. After 24h the medium was replaced with matching 
medium containing the stated concentrations of hydrogen peroxide, after a further 
24h cells were washed and images captured using a light microscope. We noted 
that cell number had a large impact on peroxide sensitivity; therefore cell seeding 
was carefully titrated so that cell number was equal across the different experi- 
mental conditions at the time of peroxide addition. 

Reactive oxygen species detection. CellROX deep red reagent (Invitrogen) was 
added to cell culture medium for 30 min. For flow cytometry, cells were detached 
by washing with PBS-EDTA followed by treatment with PBS-EDTA with trypsin 
(0.025%) and analysed on a FACSCalibur cytometer (BD Bioscience). For confocal 
microscopy, live cells were imaged using an inverted confocal fluorescence micro- 
scope (Fluoview FV1000, Olympus). 

Liquid chromatography-mass spectrometry (LC-MS). HCT116 cells (0.5- 
1.5 X 10°) were seeded in triplicate wells of 6-well plates in DMEM, duplicate 
plates were seeded for cell counts. After 16-24h cells were washed with PBS 
and received complete or -SG medium for the stated times. For glucose flux 
experiments, medium was replaced with HEPES-buffered Krebs-Ringer solution 
with 25 mM [U-'°C]p-glucose (Cambridge Isotopes), 10% dialysed FBS, 1X MEM 
amino acids (11130), 2* MEM vitamins and 2 mM t-glutamine. For serine flux 
experiments, medium was replaced with complete medium, with serine substi- 
tuted for 0.4mM [U-'°C,’°N]L-serine (Cambridge Isotopes). Cells were washed 
with PBS and metabolites extracted using methanol/acetonitrile/dH2O (5:3:2) 
(2-4 X 10° cells per ml). Samples were snap frozen, thawed at 4°C, spun at 
16,000g for 15min and supernatants collected and filtered through 0.45 um 
PTFE membranes (Millipore). Serum samples were collected at time of killing. 
20 ul of serum was added to 980 kl of extraction buffer and prepared as above. LC- 
MS analyses were performed on an Orbitrap Exactive (Thermo Scientific) in line 
with an Accela autosampler and an Accela 600 pump (Thermo Scientific). The 
Exactive operated in the polarity-switching mode with positive voltage 4.5 kV and 
negative voltage 3.5kV. Column hardware consisted of a Sequant ZIC-pHILIC 
column (2.1 X 150mm, 5 jim) coupled to a Sequant ZIC-pHILIC guard column 
(2.1 X 20mm, 54m) (Merck). Flow rate was 100 pl min ', buffers consisted of 
acetonitrile (ACN) for A, and 20 mM (NH,4)2CO3, 0.1% NH,OH in HO for B. 
Gradient ran from 80% to 40% ACN in 20 min, followed by a wash at 20% ACN 
and re-equilibration at 80% ACN. Metabolites were identified and quantified 
using LCquan software (Thermo Scientific). Metabolites were positively identified 
on the basis of exact mass within 5 p.p.m., further validated by concordance with 
standard retention times and plotted as the peak area for each metabolite. 
Quantitative PCR. Primers: PGK1 forward: CTGTGGCTTCTGGCATACCT, 
PGK1 reverse: CGAGTGACAGCCTCAGCATA; PHGDH forward: ATCTCTCA 
CGGGGGTTGTG, PHGDH reverse: AGGCTCGCATCAGTGTCC, PSATI for- 
ward: CGGTCCTGGAATACAAGGTG, PSATI reverse: AACCAAGCCCATG 
ACGTAGA, PSPH forward: GAGCGGACTCCCTTTTAAGC, PSPH reverse: 
CAGGGAGGTGAGCTGTGC, SHMT1 forward: CCCTCCCCATTTGAACACT, 
SHMTI1 reverse: GGGATCCACACTTTTCACTCC, PKM2 forward: CTCGGG 
CTGAAGGCAGT, PKM2 reverse: AATTGCAAGTGGTAGATGGCA, actin for- 
ward: TCCATCATGAAGTGTGAGG, actin reverse: TACTCCTGCTTGCTGA 
TCCAC. Reactions used SYBR Green master-mix on a 7500 Fast Real-Time PCR 
System (both Applied Biosystems). 

Chromatin-immunoprecipitation. Assays were performed as described previ- 
ously**. Cells (6-8 X 10°) were seeded in 6-well plates in DMEM, allowed to grow 
for ~40h, then washed with PBS and received complete or -SG medium for 15h, 
hydroxyurea (HU, 0.4mM) was added as a positive control. 

Cell-cycle analysis. For sub-G1 analysis, cells were grown and detached from 
plates as described above, then fixed and stained with propidium iodide (PI, 
50 ug ml ') for 30min. For cell-cycle analysis BrdU (Invitrogen) was added to 
live cells for 100 min and detected as described previously*’. Flow cytometry was 
performed on a FACSCalibur cytometer. 

PI exclusion. PI was added to cell culture medium (1 pg ml) for 5 min. Non- 
adherent and adherent cells were collected and analysed by Flow cytometry on a 
FACSCalibur cytometer. 
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ATP assay. Aliquots of cell suspension were added to TE buffer (pH 7.75) heated 
to 99°C and assayed with an ATP determination kit (Invitrogen) on a Veritas 
Microplate luminometer (Turner Biosystems). Cell suspensions were counted to 
normalize for cell number. 

Statistics. Survival was assessed by non-parametric distribution analysis (right 
censoring), using log-rank and Wilcoxon tests calculated on Minitab 16 
(Minitab Ltd). The following t-test comparisons were performed with Micro- 
soft Excel (v12.3.4): tumour volume between diet groups; unpaired, one tail. 
Tumour volume within diet groups; paired, one tail. Serum amino acids 


between diet groups; unpaired, one tail. Oxygen consumption rate within 
genotype; paired, one tail. Oxygen consumption rate between genotypes; 
unpaired, one tail. 
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Apoptotic cell clearance by bronchial epithelial cells 
critically influences airway inflammation 


Ignacio J. Juncadella!*?, Alexandra Kadl*, Ashish K. Sharma’, Yun M. Shim*, Amelia Hochreiter-Hufford!**, Larry Borish'*° 


& Kodi S. Ravichandran??? 


Lung epithelial cells can influence immune responses to airway 
allergens’*. Airway epithelial cells also undergo apoptosis after 
encountering environmental allergens’; yet, relatively little is known 
about how these are cleared, and their effect on airway inflammation. 
Here we show that airway epithelial cells efficiently engulf apoptotic 
epithelial cells and secrete anti-inflammatory cytokines, dependent 
upon intracellular signalling by the small GTPase Racl. Inducible 
deletion of Rac1 expression specifically in airway epithelial cells in a 
mouse model resulted in defective engulfment by epithelial cells 
and aberrant anti-inflammatory cytokine production. Intranasal 
priming and challenge of these mice with house dust mite extract 
or ovalbumin as allergens led to exacerbated inflammation, aug- 
mented Th2 cytokines and airway hyper-responsiveness, with 
decreased interleukin (IL)-10 in bronchial lavages. Racl-deficient 
epithelial cells produced much higher IL-33 upon allergen or 
apoptotic cell encounter, with increased numbers of nuocyte-like 
cells'*°. Administration of exogenous IL-10 ‘rescued’ the airway 
inflammation phenotype in Racl-deficient mice, with decreased 
IL-33. Collectively, these genetic and functional studies suggest a 
new role for Racl-dependent engulfment by airway epithelial cells 
and in establishing the anti-inflammatory environment, and that 
defects in cell clearance in the airways could contribute to inflam- 
matory responses towards common allergens. 

Exposure to environmental pollutants, allergens and pathogens can 
induce apoptosis of airway epithelial cells***’. In fact, clusters of 
uncleared epithelial cells referred to as “Creola bodies’ in the sputum 
have been seen in patients with asthma for decades*”. Although the 
lung contains professional phagocytes such as macrophages and den- 
dritic cells, many so-called ‘non-professional phagocytes’ can engulf 
apoptotic cells'®''. Because the bronchial epithelial cells come in con- 
tact with allergens first, and greatly outnumber the professional pha- 
gocytes in situ, we asked whether epithelial cells might contribute to 
cell clearance in the airways. 

We tested phagocytosis by primary murine airway epithelial cells, 
the human bronchial epithelial cell line BEAS-2B and the murine alveo- 
lar epithelial cell line MLE-12. To score internalization, we labelled the 
apoptotic epithelial cells with a pH-dependent dye (CypHer5), whose 
fluorescence increased within acidic phagolysosomes'*"* (Fig. 1a, b). 
BEAS-2B, MLE-12 and the primary murine airway epithelial cells 
(CD45~ EpCAM“) engulfed apoptotic targets; this was dependent 
on phosphatidylserine (PtdSer) recognition on the apoptotic cells 
(inhibitable by annexin V) (Fig. 1b). Apoptotic cell recognition by 
professional phagocytes is known to elicit anti-inflammatory media- 
tors that dampen inflammation in the local tissue environment'*”’. 
Airway epithelial cells also produced active transforming growth fac- 
tor (TGF)-f and PGE, after apoptotic cell recognition, and dependent 
on phosphatidylserine and Racl (Fig. 1c, dand Supplementary Fig. 1a, 
b). This suggests that airway epithelial cells can engulf apoptotic 
targets and produce anti-inflammatory mediators. 


To test the importance of cell clearance by airway epithelial cells 
in vivo, we wished to delete engulfment genes specifically within epi- 
thelial cells and assess airway inflammation. Because phagocytes 
often express several engulfment receptors, and the relative import- 
ance of particular engulfment receptors on the epithelial cells is not 
fully characterized, we targeted the small GTPase Racl, which func- 
tions downstream of several engulfment signalling pathways’>"®. 
Deletion of Rac] in the airway epithelial cells was achieved by crossing 
Racl™ mice with clara cell secretory protein (CCSP)-,+r4/tero-Cre 
transgenic mice, which express Cre under tissue-specific and temporal 
control (Fig. le and Supplementary Fig. 2a)'*'’. Doxycycline (Dox) 
given through drinking water induced Cre expression in the epithelial 
cells of the trachea, bronchi and bronchioles under the promoter for 
the Clara cell-specific secretory protein (CCSP)'* (Supplementary 
Fig. 2b). Dox administration to CCSP-Cre/Racl™ mice led to Racl 
deletion specifically in the epithelial cells (Fig. 1f, g), but not in other 
cells in the bronchial tissue, or in CD45* haematopoietic cells (Fig. 1f). 
Racl expression in CCSP-Cre/Racl!“"” mice was unaffected before 
doxycycline treatment, and Dox treatment of Racl”””* mice did not 
affect Racl or Rac2 expression. 

The doxycycline-inducible system allowed the airways to develop 
normally, until Racl deletion at 8 weeks. However, because Racl is a 
key cytoskeletal regulator in many cell types, we were concerned about 
potential unintended effects. Several independent lines of evidence 
alleviated this concern. First, the architecture and epithelial cell mor- 
phology of the airways were comparable at steady state in control and 
Racl deleted mice (Supplementary Fig. 2b). Second, numbers of 
yellow fluorescent protein (YFP")-expressin epithelial cells in CCSP- 
Cre/Rack™ mice crossed to Rosa26°'°? **™¥ reporter mice were com- 
parable to controls (Fig. 1f); importantly, Racl expression was deleted 
in the YFP* epithelial cells but not CD45* YFP~ haematopoietic cells 
(Fig. 1f and Supplementary Fig. 3c). Third, primary epithelial cell 
cultures yielded similar total numbers of cells positive for nitrotetra- 
zolium blue from control and CCSP-Cre/Racl!™" mice (Supplementary 
Fig. 4a). Fourth, the epithelial cell integrity and tight junction forma- 
tion was unaffected after Racl deletion, on the basis of staining for the 
tight-junction protein occludin (Supplementary Fig. 4c, d) and trans- 
mission electron microscopy (Supplementary Fig. 5a). Fifth, integrity 
of the alveolar-capillary membrane in CCSP-Cre/Rac! was intact on 
the basis of intravenous injection of Evan’s blue and tracking the dye in 
the lung (Supplementary Fig. 4b). Sixth, the uptake of intranasally 
administered fluorescently labelled antigens by epithelial cells, den- 
dritic cells, and alveolar macrophages was similar between control 
and Racl-deficient mice (Supplementary Fig. 5b, c). Collectively, 
Dox-induced deletion of Racl in the airway epithelial cells in the 
CCSP-Cre/Rac#“' mice did not affect the integrity of the epithelial cells. 

Airway epithelial cells from CCSP-Cre/Racl™ mice showed sig- 
nificant reduction in phagocytosis of apoptotic cells in vitro (Fig. 1h), 
and after intranasal administration of apoptotic cells in vivo 
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Figure 1 | Racl-dependent engulfment and anti-inflammatory cytokine 
production by airway epithelial cells. a, Uptake of CypHer5-labelled 
apoptotic epithelial cells by viable epithelial cells (CFSE). b, Internalization is 
blocked by annexin (anx) V (n = 5). AC, airway epithelial cells. c, TGF-B 

(n = 5) and PGE; (n = 3) production by BEAS-2B epithelial cells. 

d, Phagocytosis and TGF-f production in MLE-12 epithelial cells transfected 
with Racl"!”" (n = 3). e, Schematic for generating CCSP-Cre/Raci" mice and 
Dox-induced (‘Tet-On’) Racl deletion in airway epithelial cells. f, Left, YEP* 
epithelial cells from control and Racl-deficient mice. Right, Racl mRNA 
expression in epithelial cells (n = 10). g, Loss of Racl in the airways of Dox- 
treated CCSP-Cre/Ract" mice (open arrowheads). h, Left, engulfment by 
airway epithelial cells from control and Racl-deficient mice (n = 5). 

**P < 0.01, representative of three experiments. Middle and right, IL-10 and 
TGE-B in BAL fluid of control and Racl-deficient mice after intranasal 
administration of apoptotic cells (f, h, m = 5-10 mice from three experiments). 
*P < 0.05, **P < 0.01, unpaired Student’s t-test with Welch’s correction 

(b, c, d, h), Mann-Whitney test (h). Error bars, s.e.m. 


(Supplementary Fig. 6a); in contrast, alveolar macrophages from these 
mice showed no phagocytic defect (Supplementary Fig. 6b). Intranasal 
administration of apoptotic cells resulted in significantly decreased 
amounts of anti-inflammatory cytokines IL-10 and TGF-f in the 
CCSP-Cre/Raci™ mice (Fig. lh). Because the CCSP-Cre/Rac 
mice have normal Racl in CD45* cells (such as macrophages), the 
epithelial cells clearly influenced cytokine production in the bronchial 
spaces in response to apoptotic cells. Furthermore, in Rack mice 
crossed with LysM-Cre mice (Cre expression targeted to the myeloid 
lineage)’’, intranasal administration of apoptotic cells did not show a 
decrease in IL-10 or TGF-f in the bronchoalveolar lavage (BAL) fluid 
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(Supplementary Fig. 8b). Thus, mice lacking Racl in the airway epi- 
thelial cells showed defective ability to engulf apoptotic cells and 
altered inflammatory response in vivo. 

We next tested these mice in airway inflammation models. One com- 
mon approach to induce airway inflammation is to prime with antigen 
intraperitoneally and subsequently challenge with the same antigen by 
the intranasal route. We tested the development of allergic inflam- 
mation in CCSP-Cre/Racl™ mice to low-endotoxin house dust mite 
(HDM) extract, a common airway allergen. The control mice showed 
T-cell and eosinophil infiltration, along with immunoglobulin-E (IgE) 
response and Th2 cytokines IL-4, IL-5 and IL-13, but all these para- 
meters were heightened in the CCSP-Cre/Rack™ mice (Supplemen- 
tary Fig. 7), suggesting that losing Racl expression in airway epithelial 
cells exacerbates inflammation. 

Because humans encounter environmental antigens through the 
airways before developing symptoms of airway inflammation, we 
established a system where both the primary allergen encounter and 
secondary challenge were through the intranasal route (Fig. 2a). After 
Dox-induced Racl deletion followed by intranasal priming and 
challenge with HDM, the CCSP-Cre/Rac#™ mice showed a strong 
airway inflammation phenotype with significant accumulation of mucus 
(Fig. 2b). In contrast, controls showed minimal or undetectable airway 
inflammation after intranasal priming and challenge. These control 
conditions included Dox-treated, saline injected CCSP-Cre/Racl™" 
mice, Dox-treated Rack" mice without Cre, CCSP-Cre/Racl™™ 
mice and CCSP-Cre/Racl" mice that did not receive doxycycline. 
Although we often observed more uncleared apoptotic nuclei in aller- 
gen challenged airways of CCSP-Cre/Racl™ mice (by TdT-mediated 
dUTP nick end labelling and cleaved caspase 3 staining), the staining 
was variable between samples, probably because of mucus and other 
secretions in inflamed tissues (data not shown). 

Allergic airway inflammation involves Th2 responses in the 
lungs®”°?!. The CCSP-Cre/Racl™" mice showed high concentrations 
of Th2 cytokines IL-4, IL-5, IL-13, eosinophil and T-cell infiltration 
in BAL fluid (Fig. 2c). IL-10 was significantly reduced in the Racl- 
deficient mice (Fig. 2c), correlating with low IL-10 seen in airways of 
asthmatic patients’. When re-stimulated with HDM ex vivo, CbD4* T 
cells from the mediastinal lymph nodes of CCSP-Cre/Rack™" mice 
produced much higher amounts of IL-4, IL-5 and IL-13 than control 
mice (Fig. 2d). When Racl deletion in the airway epithelial cells was 
controlled (by Dox administration) at either the priming (sensitiza- 
tion) or challenge phases, keeping Racl expression normal at the 
priming phase and deleting at the challenge phase showed minimal 
airway inflammation to HDM (Supplementary Fig. 9). These data 
suggest that Racl expression in airway epithelial cells plays a criti- 
cal role in preventing sensitization to common airway allergens 
encountered by the intranasal route. 

When we analysed lung function parameters, HDM-sensitized 
CCSP-Cre/Racl™ mice showed physiological changes consistent with 
asthma including much greater airway resistance, decreased com- 
pliance and higher pulmonary arterial pressure (Fig. 2e). Loss of Racl 
led to a twofold increase in methacholine-induced airway hyper- 
responsiveness compared with wild-type littermates (Supplementary 
Fig. 10). Thus, Racl expression in airway epithelial cells critically con- 
trols allergen-induced airway inflammation and the physiological 
changes of asthma. Interestingly, intranasal priming and challenge of 
the LysM-Cre/Racl™ mice with HDM did not show enhanced airway 
inflammation (Supplementary Fig. 8c, d). 

We next asked whether exogenous IL-10 could reverse the inflam- 
matory phenotype in CCSP-Cre/Racl“ mice. Recombinant mouse 
IL-10 (rIL-10) given at priming and challenge phases with HDM 
resulted in markedly reduced airway inflammation, manifested by 
reduced production of mucus (Fig. 3a, b), and lower Th2 cytokines 
IL-4, IL-5 and IL-13 in the airways. The mediastinal lymph nodes were 
substantially smaller in the rIL-10-treated CCSP-Cre/Racl!! mice, 
with a significant decrease in the HDM-induced production of Th2 
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Figure 2 | Mice lacking Racl in airway epithelial cells show increased 
allergic inflammation. a, Protocol for inducing airway inflammation. 

b, Representative haematoxylin and eosin (H&E) and periodic acid Schiff 
(PAS) staining of lung sections from control or CCSP-Cre/Racl™" mice. 

c, Infiltrating CD4* T cells, eosinophils and Th2-type cytokines in BAL fluid of 
control or Racl-deficient mice (m = 12-18 mice per group from more than four 


cytokines (Fig. 3c-e). rIL-10-treated mice also showed improved pul- 
monary function (Fig. 3f). This suggests that Racl expression in epi- 
thelial cells and IL-10 amounts in the airways are complementary 
factors regulating airway inflammation. 

Preparations of HDM are enriched in the type-2-group allergen Der 
p2, an LPS-binding protein®***. To test whether mice lacking Rac] in 
the airway epithelial cells are inherently sensitive to allergen exposure 
in the absence of adjuvant activity, we first primed CCSP-Cre/Racl™" 
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experiments). d, Cytokine production by mediastinal lymph node (LN) cells re- 
stimulated in vitro for 5 days with HDM (n = 8-10 mice per group from three 
experiments). e, Airway resistance, compliance and pleural pressure in control 
or in Racl-deficient mice (1 = 5 mice from two experiments). *P < 0.05, 

**P < 0.01, ***P < 0.001, unpaired Student’s t-test with Welch’s correction 
(c-e). 


mice with recombinant Der pl, which showed augmented IgE and 
IgG1 antibody response and increased inflammation (Supplemen- 
tary Fig. 7e, f). Second, although intranasal administration of chicken 
ovalbumin (Ova) without LPS often leads to tolerance induction”, in 
CCSP-Cre/Racl™ mice intranasal priming/challenge with Ova led to 
airway inflammation (Fig. 4a—-d). Control mice showed minimal 
inflammation. Mediastinal lymph node cells from Racl-deficient mice 
also produced higher IL-4, IL-5 and IL-13 (Fig. 4e). We confirmed that 
control mice primed with (Ova plus alum) by the intraperitoneal route 
and challenged with Ova by the intranasal route showed airway 
inflammation (Supplementary Fig. 11). In CCSP-Cre/Rack™ mice, 
the amounts of IL-10 in the BAL fluid were lower after Ova priming 
and challenge, and administering rIL-10 significantly reduced this 
inflammation (Fig. 4f-h). Thus, Racl loss in the bronchial epithelial 
cells and reduced IL-10 creates a state that is prone to inflammation in 
response to normally innocuous inhaled allergens. 

Thymic stromal lymphopoietin (TSLP) and IL-33 represent two 
cytokines linked to promoting airway inflammation’**’’, Although 
TSLP did increase in the BAL fluid after HDM challenge, this was 
comparable between control and CCSP-Cre/Racl™ mice. However, 
IL-33 was significantly higher in the CCSP-Cre/Rac!™ mice after aller- 
gen challenge (although basal IL-33 without HDM challenge was not 
increased) (Fig. 5a). Treatment with rIL-10 diminished the IL-33 


Figure 3 | Allergic airway inflammation in Racl-deficient mice is rescued by 
rIL-10. a, Schematic of inducing airway inflammation and IL-10 treatment. 
b, H&E and PAS staining of lung sections from CCSP-Cre/Rac!™ mice treated 
with or without rIL-10 (1 1g), representative of several experiments. c, BAL 
fluid analysis showing IL-4, IL-5 and IL-13 production by Rac1-deficient mice 
treated with or without rIL-10 (n = 8-10 mice per group, from three 
experiments). d, Image of mediastinal lymph nodes (n = 3 mice per group). 
e, Cytokine production by lymph node cells re-stimulated in vitro for 5 days 
with HDM (n = 5 mice per group). f, Airway resistance, compliance and 
pleural pressure in Racl-deficient mice treated with or without rIL-10 (nm = 5) 
(d-f, representative of three experiments). *P < 0.05, unpaired Student’s t-test 
with Welch’s correction (c, e, f). 
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amounts in the BAL fluid (Fig. 5a). IL-33 has been implicated in the 
pathogenesis of many diseases including asthma”. Recently, IL-33 has 
been shown to activate and expand a non-T/non-B population of innate 
lymphoid cells (‘nuocytes’), linked to promoting airway inflamma- 
tion?”°. After HDM priming, lungs from CCSP-Cre/Racl™" mice 


b 


Figure 4 | Airway inflammation with Ova after Racl deletion. a, Schematic 
of intranasal priming and challenge with Ova. b, H&E staining of lung sections 
from Dox-treated control and CCSP-Cre/Racl™ mice (X20 magnification). 
c, Infiltrating CD4°* T cells and eosinophils in BAL fluid of indicated mice. 
d, Th2-cytokines in the BAL fluid. (c, d, n = 10 mice per group). e, Cytokine 
production by mediastinal lymph node T cells re-stimulated in vitro for 5 days 
with Ova (n = 6 mice per group). (c-e, Representative of three experiments). 
f-h, Racl-deficient mice primed and challenged with Ova with IL-10. f, H&E 
staining of lung sections. Arrowheads indicate leukocyte infiltration (x20 
magnification). g, Image of mediastinal lymph nodes (f, g, representative of 
three experiments). h, BAL fluid analysis of infiltrating T-cells, eosinophils and 
Th2 cytokines (m = 6 mice combined from two experiments). *P < 0.05, 

**P < 0.01, unpaired Student’s t-test with Welch’s correction. 


contained significantly higher IL-5-expressing cell populations resem- 
bling nuocytes (lineage’ , T1/ST2* CD44", Sca-1*, ICOS*) (Fig. 5b). 
Interestingly, after HDM priming of CCSP-Cre/Racl™ mice (less than 6 
days), the lung nuocytes (but not CD4” T cells) were positive for IL-5 
(Supplementary Fig. 12a, b), suggesting inducible nuocyte activation. A 
neutralizing IL-33 antibody reduced the inflammation in the CCSP-Cre/ 
Rack mice, with reduction in lymph node cellularity, infiltrating 
eosinophils and activated CD4* T cells (Supplementary Fig. 12c, d). 
Although the function of this lung nuocyte-like population needs further 
testing, these data suggest a link between enhanced IL-33 production 
in the CCSP-Cre/Racl mice after allergen challenge, the expansion 
of an IL-5 producing nuocyte-like population and airway inflammation. 

Because epithelial cells are a main source of IL-33, we asked how 
Racl affects IL-33 expression. When MLE-12 epithelial cells were 
stimulated with HDM or apoptotic cells, IL-33 messenger RNA 
(mRNA) was upregulated; drug-mediated inhibition of Racl (ref. 30) 
or short interfering RNA (siRNA) knockdown of Rac] led to a three- to 
sixfold increase in IL-33 (Supplementary Fig. 13 and Fig. 5c). This IL-33 
mRNA upregulation due to Rac inhibition was not a generic effect, as 
the inhibitor greatly diminished Racl-dependent TGF-B expression 
(Supplementary Fig. 13b). 

To test whether Racl deletion affects IL-33 expression in pri- 
mary airway epithelial cells during allergen challenge, we used the 
CCSP-Cre/Racl™' mice crossed to the Rosa26°'°” ¥¥¥? reporter mice. 
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Figure 5 | IL-33 upregulation in Racl -deficient epithelial cells correlates 
with airway inflammation. a, IL-33 and TSLP in BAL fluid of control or Racl- 
deficient mice. b, Strategy for gating nuocyte-like cells and their numbers in 
lungs after HDM priming (a, b, n = 6 or7 mice from two experiments). ¢, IL-33 
or TGF-B expression in Racl-siRNA-treated MLE-12 cells with apoptotic cells 
(n = 3) (c, representative of three experiments). d, e, In vivo IL-33 induction 


550 | NATURE | VOL 493 | 24 JANUARY 2013 


after intranasal HDM or apoptotic cells in CCSP-Cre/Rosa26™*” mice (d), or in 
purified YFP* EpCam* epithelial cells (n = 5 or 6 mice, from three 
experiments) (e). f, IL-33 mRNA in human nasal epithelial cells stimulated with 
HDM or apoptotic cells (m = 3); *P < 0.05, **P < 0.01, unpaired Student’s 
t-test with Welch’s correction. Error bars, s.e.m. 
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Although the YFP* EpCAM” airway epithelial cells from control 
and CCSP-Cre/Rack" mice had comparable amounts of basal IL-33 
message, this was significantly increased after HDM stimulation. 
Second, in the BAL fluid of Racl-deficient mice, IL-33 cytokine was 
increased three- to fivefold after intranasal priming with HDM, 
whereas the TSLP amounts were comparable (Fig. 5d, e). Third, after 
intranasal injection of apoptotic epithelial cells, purified YFP 
EpCAM‘ epithelial cells from Racl-deficient mice had much higher 
IL-33 mRNA (Fig. 5e). This link between Racl and il-33 expression 
was also seen in human nasal epithelial cells, as production of IL-33 
by HDM or apoptotic cells was strongly enhanced by Rac inhi- 
bition (Fig. 5f). Collectively, these data suggest a critical role for 
Racl expression in epithelial cells in regulating IL-33 expression 
and, in turn, the sensitization to inhaled allergens. 

Although airway epithelial cells are the first to come in contact with 
the allergens, their contribution to tolerance towards common airway 
allergens has been less clear. The data presented here suggest that 
airway epithelial cells can be phagocytic, and that their ability to 
secrete/modulate cytokines can critically influence tolerance to com- 
mon airway allergens. Thus, apart from a physical barrier, phagocy- 
tosis in the airways may be part of an extra line of immune protection 
against innocuous antigens. These data also identify a new Rac- 1 depen- 
dent IL-33:IL-10 axis in controlling airway inflammation. The amounts 
of the pro-inflammatory cytokine IL-33 and the anti-inflammatory 
cytokine IL-10 seem linked, with Racl expression in airway epithelial 
cells controlling both steps. Although asthmatic patients have deficits 
in IL-10 (ref. 22), the contribution of IL-10 to allergen sensitization 
and its potential therapeutic use remains unexplored. Our studies 
suggest IL-10 as a critical regulator of airway inflammation, and 
IL-10 might prove a new therapeutic approach to limiting airway 
inflammation. Lastly, although cell clearance and anti-inflammatory 
cytokine production by professional phagocytes have long been 
studied, the relative importance of cell clearance by non-professional 
phagocytes under physiological settings is still limited. These data 
suggest that airway epithelial cells can be both phagocytic and modu- 
late the cytokine milieu to limit airway inflammation. 


METHODS SUMMARY 

Mice. Rac!" mice were crossed with CCSP-,rr4/tero-Cre mice" and Rac! deletion 
induced by doxycycline (1 mg ml’) in drinking water. To track the Racl-deleted 
airway epithelial cells specifically, the CCSP-Cre/Racl™" were crossed with the 
Rosa26°! OP EYFP reporter mice (JaxMice). 

Induction of airway inflammation. Mice were primed intranasally with low- 
endotoxin HDM extract (10g in 50 pl volume, Indoor Biotechnologies), or 
chicken Ova (201g) on days 0, 3, 5. Control mice received saline during the 
priming and challenge phases. Mice were challenged intranasally on days 10, 12 
and 14, and analysed for Th2 cell-dependent airway inflammation on dayl6. 
Control or CCSP-Cre/Racl™ mice were treated intranasally for 24h with 10 pg 
of HDM or 5 million to 10 million apoptotic ultraviolet-treated Jurkat cells or 
etoposide-treated (100 4M) BEAS-2B cells. BAL fluid was collected (24h) and 
cytokine production analysed by enzyme-linked immunosorbent assay (ELISA). 
In vitro engulfment assay. MLE-12, BEAS-2B cells or primary airway epithelial 
cell cultures were plated at a density of 3 X 10° in a 12-well plate and incubated 
with 3 X 10° CypHer5 or TAMRA-labelled apoptotic Jurkat T-cells or etoposide 
(100 .M)-treated apoptotic BEAS-2B for 4h. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Mice. Mice carrying floxed alleles for rac1 were crossed with mice carrying the Cre 
recombinase driven by the rat CCSP promoter (CCSP-rra/tero-Cre)’*. Racl dele- 
tion in CCSP-Cre/Raci" mice was achieved by administering doxycycline (1 mg 
ml‘) in drinking water containing 0.4% sucrose. The various control mice 
included CCSP-Cre/Ract™" mice that received no doxycycline, Ract”” 
mice (no Cre) that received doxycycline, doxycycline-treated CCSP-Cre/ 
Rac” heterozygous mice or CCSP-Cre/Rac1"”™' mice. To facilitate flow cyto- 
metric isolation of those airway epithelial cells with Racl deletion, the CCSP-Cre/ 
Rac!" were further crossed with the Rosa26°10? F¥#? reporter mice (The Jackson 
Laboratory). The Cre/Racl" mice were also crossed with either LysM-Cre strain 
to achieve deletion of Racl in the myeloid lineage, or the CD11c-Cre mice to 
achieve deletion in the dendritic cell lineage. The University of Virginia Animal 
Care and Use Committee approved all animal experiments. 

Induction of airway inflammation. Mice were primed intranasally with 50 tl of 
low-endotoxin preparation of the HDM extract (10 jig, Indoor Biotechnologies) or 
with chicken Ova (20 1g, Sigma-Aldrich) on days 0, 3, 5. Control mice received 
saline during the priming and challenge phases. Mice were then challenged intra- 
nasally on days 10, 12 and 14. On day 16, the mice were analysed for Th2 cell- 
dependent eosinophilic airway inflammation. Alternatively, mice were primed 
with HDM, Der p1 or Ova by intraperitoneal injection with Alum (2.25 mg, 
Pierce) on days 0, 3, 5, followed by intranasal challenge on days 10, 12 and 14 
and analysed for inflammation. 

Collection of BAL fluid and total cell numbers. BAL was performed by injecting 
1 ml of phosphate-buffered saline (PBS) containing 0.01 mM EDTA through a 
cannula, flushing twice and collecting 500-700 pl. The BAL fluid was centrifuged 
and the cells were washed with PBS containing 1% FBS. The cells were stained for 
monocytes/macrophages, neutrophils, T cells, alveolar macrophages and eosino- 
phils using the following antibodies: anti-mouse Ly6G, Ly6C, F4/80, CD11b (for 
macrophages and neutrophils), CD4, CD25, CD44, CD69 (for T cells), CD11c (for 
dendritic cells) and Siglec F (BD Biosciences) (for eosinophils and alveolar macro- 
phages). The absolute cell numbers of cells were determined by adding reference 
particles (Spherotec) during flow cytometry (Supplementary Fig. 14). Data 
were collected on FACS Canto (Becton Dickinson) and analysed with Flowjo 
software (Treestar). The BAL fluid was stored at —80°C and further used in 
analysis of cytokines. 

Cytokine analysis. IL-4, IL-5, IL-13, IL-10, IL-33 and TSLP were measured by 
ELISA (using antibodies from R&D Systems) in the BAL fluid by ELISA 48 h after 
the last challenge. HDM-specific IgE and IgG1 were measured in the sera and BAL 
fluid of mice sensitized and challenged with HDM by ELISA (antibodies from 
BD Biosciences). HDM or Ova-specific CD4* T-cell responses were measured by 
re-stimulating 5 X 10° cells from mediastinal lymph nodes for 5 days with 10 pg of 
HDM or Ova in the presence of 5 X 10° mitomycin C-treated splenocytes. 
Microscopy and histology. The expression of CCSP, Racl and the Cre recom- 
binase was determined by immunofluorescence staining. An optimal cutting tem- 
perature compound (O.C.T., Sakura Finetek) was injected into the lungs through a 
tracheal cannula. The lungs were excised and ‘snap frozen’ with liquid nitrogen 
and stored in —80 °C. Frozen sections (0.8 lum) were cut and stained with primary 
antibody to CCSP (1:50, Santa Cruz Biotechnology), Racl (1:20, Cytoskeleton) 
and Cre, using a FITC- (fluorescein isothiocyanate) labelled antibody (1:500, 
BioLegend) for 30 min at room temperature. The sections were washed three times 
with PBS and stained for 30 min at room temperature with anti-goat Alexa Fluor® 
647 and anti-rabbit Alexa Fluor® 594 secondary antibodies (1:500, Invitrogen). 
The nuclei were counterstained with DAPI (4’,6-diamidino-2-phenylindole) 
(Invitrogen). The samples were analysed using a Zeiss Axiolmager Z2 equipped 
with Apotome for optical sectioning (Zeiss). 

To assess the amount of inflammation and mucus secretion, lung sections were 
paraffin-embedded, fixed in 10% neutral buffered-formalin and 0.8 jum sections 
were stained with H&E and PAS, respectively. All staining procedures were per- 
formed by the histology core facility at the University of Virginia. 

Airway epithelial cell isolation and culture. For primary airway epithelial cell 
purification and culture, lungs were perfused through the right ventricle with 
buffered saline and carefully excised. Two millilitres of dispase (BD bioscience) 
were instilled into the lungs, followed by 1% low-melting agrose (Invitrogen) and 
incubated for 1h at 37 °C. After incubation, the lungs were minced and mecha- 
nically disrupted by passing them through 100 and 70 1m nylon strainers. Single- 
cell suspensions were washed with PBS containing 1% FBS and 100 units of 
DNasel and treated with red blood lysis buffer (Sigma-Aldrich) for 5 min. The 
cells were then washed, centrifuged, and CD45* cells were depleted from the 
single-cell suspensions by incubating with anti-CD45 microbeads and passed 
through a MACS® Cell Separation Column (Miltenyi Biotech). The cells were 
re-suspended in DMEM/F-12 media containing 10% FBS and plated in a 10cm 
culture dish at 37°C for 2h. After incubation, the non-adherent cells were 


collected, centrifuged and seeded on a 10cm culture plate coated with calfskin 
collagen type I (Sigma-Aldrich). 

Lung function analysis. Lung function was measured using an in situ buffer- 
perfused mouse lung system. Animals were anaesthetized and positive-pressure 
(2 cmH,O) ventilation (120 breaths per minute, tidal volume) was initiated. The 
lungs were perfused with Krebs-Henseleit buffer, containing 2% albumin, 0.1% 
glucose and 0.3% HEPES. The lungs were allowed to equilibrate for 5 min and 
pulmonary compliance, resistance and pulmonary arterial pressure were mea- 
sured and recorded using Pulmodyn data-acquisition software (Hugo Sachs 
Electronik). 

Airway hyper-responsiveness. Airway hyper-responsiveness was measured by 
exposing mice to increasing doses of aerosolized methacholine (0-10 mg ml). 
A tracheotomy tube was inserted and then connected to the inspiratory and 
expiratory ports of a volume-cycled ventilator (flexiVent; SCIREQ Scientific). 
The data from normal ventilation were collected to establish the baseline values 
for each animal. The change in airway resistance (AR) from baseline was calculated 
after each dose. 

Transmission electron microscopy. Lungs from wild-type or Racl-deficient 
mice were perfused with buffered saline and fixed in 4% paraformaldehyde solu- 
tion containing 2% glutaraldehyde for 24h. Lung sections were prepared and 
analysed by the University of Virginia advance microscopy facility. 

In vitro engulfment assay. MLE-12, BEAS-2B cells or primary airway epithelial 
cell cultures were plated at a density of 3 X 10° in a 12-well plate and incubated 
with 3 X 10° CypHer5 or TAMRA-labelled apoptotic Jurkat T-cells or etoposide- 
(100 uM) treated apoptotic BEAS-2B for 4h. In some experiments, viable BEAS- 
2B cells were stained with CFSE (2.5 uM) before incubation with apoptotic cells. 
Phagocytosis was analysed using a BD FACS Canto (Becton Dickinson). To block 
phagocytosis, 1 tM of cytochalasin D (Sigma-Aldrich) or 10 jg ml! of purified 
human annexin-V (Enzo Life Sciences) were used. In some experiments the spe- 
cific Racl inhibitor EHT,g64 (30 UM, Sigma-Aldrich) was used. 

In vivo engulfment assay. CCSP-Cre/Racl™" mice and control/Racl™ mice 
crossed with Rosa26°'°P *¥™® reporter mice (and therefore having Cre-marked 
airway epithelial cells) were injected intranasally with 10 X 10° CypHer-labelled 
apoptotic thymocytes for 3h. The lungs were then minced, digested with collage- 
nase type II (183 U ml *) for 1h and passed through a 70 um strainer. Single-cell 
suspensions were gated on YEP" cells and engulfment of apoptotic cells was 
analysed. Alveolar macrophages were collected from BAL fluid and stained with 
anti-CD11b, anti-F4/80 antibodies and analysed for uptake of CypHer-labelled 
apoptotic cells. 

In vivo treatment with HDM or apoptotic cells. Control or CCSP-Cre/Racl™" 
mice were treated intranasally for 24h with 10g of HDM or 5million to 
10 million apoptotic ulraviolet-treated Jurkat or etoposide-treated (100 1M) 
BEAS-2B. After 24h, BAL fluid was collected and the cytokine production was 
analysed by ELISA. 

Vascular permeability. Vascular permeability in the lungs was determined using 
the Evans blue dye extravasation method. Evans blue (20 mg kg’, Sigma-Aldrich) 
was injected intravenously for 1 h before the mice were killed. The mice were then 
perfused with 10 ml PBS to remove intravascular dye. The lungs were then homo- 
genized in PBS to extract the dye and centrifuged. The absorption of Evans blue 
was measured in the supernatants at 620 nm, and corrected for the presence of 
haem pigments as follows: Ag2o (corrected) = Ag29 — (1.426A749 + 0.030). 
Antigen uptake assay. Uptake of soluble antigen by dendritic cells, alveolar 
macrophages, or epithelial cells, 101g of fluorescently labelled HDM or Ova 
(Alexa Fluor® 594) was injected intranasally for 1h. Total lungs were minced 
and treated with collagenase or dispase, single-cell suspensions were prepared and 
stained for dendritic cells, alveolar macrophages and epithelial cells, and analysed 
for the percentage of antigen uptake. 

Identification of nuocyte-like cells in the lung. Total lung suspensions were 
prepared and gated on lineage-negative populations by staining with the following 
cocktail of antibodies: mouse anti-CD3, CD4, CD8, B220, CD11c, CD11b, CD49b, 
Ly6C/Ly6G. These cells were further gated on the basis of T1/ST2 expression 
(mouse biotin-conjugated anti-T1/ST2 antibody (clone DJ8 1:50 dilution), 
BioLegend), CD44 and Sca-1 (1:100 dilution, eBioscience). These nuocyte-like 
cells were further confirmed by ICOS surface expression (anti-ICOS antibody, 
eBioscience). Intracellular IL-5 expression was analysed by intracellular staining 
using anti-mouse IL-5 antibody (eBioscience). 

Treatment with anti-IL-33 neutralizing antibody. CCSP-Cre/Rac!™ or control 
mice were intranasally primed and challenged with HDM (10 1g) in the presence 
of anti-IgG (isotype control, 6 j1g per dose) or anti-IL-33 (6 tg per dose) antibody 
(R&D Systems). After the last challenge, the mediastinal lymph nodes were excised 
and total cellularity was analysed. BAL fluid was collected and stained for activated 
T cells and eosinophils. 
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Transfection with siRNA. MLE-12 cells were transfected with a nucleofector, 
with 250nM of siRNA targeting Racl or a control non-targeting SMARTpool 
(Thermo Scientific, catalogue number L-041170-00-0005), using the recom- 
mended protocol for primary mammalian epithelial cells (Lonza). 

Quantitative PCR analysis. Total RNA was extracted from epithelial cells using 
the Qiagen Qiashredder and RNeasy kit, and cDNA was synthesized using 
SuperScript III kit (Invitrogen). Quantitative PCR for Racl, Rac2, TGF-B and 
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IL-33 was performed using the TagMan Gene Expression Assay (Applied 
Biosystems), and were run on a STEP One plus instrument (ABI). It is noteworthy 
that the higher IL-33 measured was not derived from the apoptotic cells as species- 
specific primers were used (Supplementary Fig. 13b). 

Statistical analysis. Statistical significance for individual data points was deter- 
mined by an unpaired Student’s two-tailed t-test, unless noted otherwise. A P value 
less than 0.05 was considered statistically significant. 
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Functional and evolutionary insight from the crystal 
structure of rubella virus protein El 
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Little is known about the three-dimensional organization of rubella 
virus, which causes a relatively mild measles-like disease in children 
but leads to serious congenital health problems when contracted in 
utero’. Although rubella virus belongs to the same family as the 
mosquito-borne alphaviruses, in many respects it is more similar 
to other aerosol-transmitted human viruses such as the agents of 
measles and mumps. Although the use of the triple MMR (measles, 
mumps and rubella) live vaccine has limited its incidence in western 
countries, congenital rubella syndrome remains an important 
health problem in the developing world. Here we report the 1.8 A 
resolution crystal structure of envelope glycoprotein E1, the main 
antigen and sole target of neutralizing antibodies against rubella 
virus. E] is the main player during entry into target cells owing to its 
receptor-binding and membrane-fusion functions. The structure 
reveals the epitope and the neutralization mechanism of an import- 
ant category of protecting antibodies against rubella infection. It 
also shows that rubella virus E1 is a class II fusion protein, which had 
hitherto only been structurally characterized for the arthropod- 
borne alphaviruses and flaviviruses. In addition, rubella virus El 
has an extensive membrane-fusion surface that includes a metal site, 
reminiscent of the T-cell immunoglobulin and mucin family of 
cellular proteins that bind phosphatidylserine lipids at the plasma 
membrane of cells undergoing apoptosis. Such features have not 
been seen in any fusion protein crystallized so far. Structural com- 
parisons show that the class II fusion proteins from alphaviruses 
and flaviviruses, despite belonging to different virus families, are 
closer to each other than they are to rubella virus El. This suggests 
that the constraints on arboviruses imposed by alternating cycles 
between vertebrates and arthropods resulted in more conservative 
evolution. By contrast, in the absence of this constraint, the strictly 
human rubella virus seems to have drifted considerably into a 
unique niche as sole member of the Rubivirus genus. 
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Figure 1 | Negative-stain electron microscopy observations of Ele trimers 
in isolation and bound to liposomes. a, Isolated Ele trimers tend to line up at 
the air—water interface in the electron microscope grid, with their three-fold 
axes perpendicular to the plane of the paper. b, c, Rubella virus Ele covers the 


The human pathogenic rubella virus is the cause of German measles, 
a highly contagious childhood airborne disease that is endemic 
throughout the world. The virus efficiently crosses the placental barrier 
when it infects pregnant women, causing miscarriage or serious con- 
genital disease in newborns. Rubella belongs to the Togaviridae family 
of positive-strand RNA viruses’. Although it has the same genome 
organization as the alphaviruses (the only other genus in this family), 
the corresponding structural proteins show no detectable sequence 
similarity. Rubella virus has two envelope glycoproteins, El and E2 
(481 and 282 amino acids long, respectively), which are anchored in 
the viral membrane by a carboxy-terminal transmembrane segment. 
E1 carries the principal antigenic determinants’ and is responsible for 
receptor recognition’ and low-pH-triggered membrane fusion upon 
internalization through receptor-mediated endocytosis**. El was 
shown to bind myelin oligodendrocyte glycoprotein to gain entry into 
certain cells’, which may account for the serious demyelinating symp- 
toms observed in some patients with congenital rubella syndrome. 

For structural studies we produced the rubella virus El ectodomain 
(termed Ele, Supplementary Fig. 1) in Drosophila S2 cells, as described 
in Methods. Secreted Ele forms trimers that bind to liposomes (Fig. 1), 
on which they form hexagonal arrays similar to those formed by their 
alphavirus and flavivirus counterparts in the post-fusion form’, sug- 
gesting that recombinant Ele is in the post-fusion conformation. The 
crystal structure (Fig. 2 and Supplementary Table 1) indeed revealed 
clear structural homology to viral class I fusion proteins (Supplemen- 
tary Fig. 2 and Table 1), featuring three B-sheet-rich domains termed I, 
ILand III (Fig. 2). The Ele subunits of the trimer adopt the typical post- 
fusion hairpin conformation. 

Comparisons with the alphavirus and flavivirus class II fusion pro- 
tein structures (Supplementary Figs 2 and 3a), together with electron 
micrographs of Ele on liposomes (Fig. 1b, c), unambiguously define 
the membrane-facing (‘top’) side and the membrane-distal (‘bottom’) 


liposome surface forming an ordered hexagonal lattice, viewed from the top 
(b) and from the side (c). Note the clear three-fold symmetry of the lattice 
building blocks. Scale bars, 100 nm. Arrows mark regions that are magnified 
in insets. 
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end of trimeric Ele. Domain III is swapped”? in the Ele trimer, occu- 
pying the space taken up by domain III from the neighbouring subunit 
in the alphavirus and flavivirus class II post-fusion trimers (Sup- 
plementary Fig. 2). The ‘stem’, which connects domain III to the 
transmembrane anchor, is clearly visible until six residues before the 
transmembrane segment, ending next to the fusion loops in a region 
that would be in the lipid-head-contact area in the full-length trans- 
membrane protein (Fig. 2, Supplementary Fig. 2 and Supplementary 
Video 1). The conformation of the stem had not been observed before, 
because this segment was absent in the structures of the alphavirus and 
flavivirus fusion proteins. 

Using the Dali server'', comparisons with other protein structures 
revealed that rubella virus El appears equally distant from its alpha- 
virus and flavivirus counterparts (Table 1), which are structurally clo- 
ser to each other despite belonging to different virus families. The main 
differences are concentrated in the fusion-loop-containing domain II. 
In rubella virus El, domain IJ is larger because of the presence of three 
important insertions in its membrane-proximal half, a region termed 
‘tip’ (Fig. 2 and Supplementary Figs 2 and 3). The overall effect is that 
Ele displays a considerably larger surface for interaction with lipids, as 
highlighted in Supplementary Fig. 2b. Instead of a fusion loop, which 
in the alphavirus and flavivirus fusion proteins maps to the connection 
between strands c and d (the cd loop), rubella virus E1 has an insertion 
of almost 40 residues in between these two strands (‘Insert 2’, 
Supplementary Fig. 3). These extra residues make up a ‘fusion surface’, 
so named by analogy to the fusion loop (see Supplementary Video 1), 
which in the context of the trimer displays a continuous area of about 
8,000 A? (Supplementary Fig. 2b). The short amphipathic helix «2 is 
positioned such that it projects a tyrosine side chain per turn towards 
the membrane (Tyr 108 and Tyr 112), and the 3) helical turn 1 
projects Tyr 101 (Fig. 2b, c; see also Supplementary Fig. 4 for secondary 
structure nomenclature). In total, the fusion surface projects out 15 
aromatic side chains (mainly tyrosines), also including Tyr95 and 
Phe 123 from each subunit, clustered around the three-fold molecular 
axis. The ij loop also reaches the membrane-contact region, exposing 
the side chain of Ile 280 at the top (Fig. 2b, c). At the centre, the fusion 
surface is sealed by Pro 104 of the three subunits interacting with each 
other at the three-fold axis (Fig. 2c). In line with the stabilizing role of 
this ‘proline seal’, the P104G mutation was shown to result in 1-log 
reduction in rubella virus infectivity’. 

The polypeptide segment forming most of the fusion surface is 
framed at its amino- and carboxy-terminal ends by two loops, termed 
fusion loops 1 and 2 (FL1 and FL2, residues 88-93 and 131-137, 
respectively; see Supplementary Fig. 4), which project towards the 
membrane (Supplementary Fig. 2). The aromatic residues Tyr 90 in 
FL1 and Trp 131 and Phe 133 in FL2 are at the very top of the molecule, 
but the side chains are oriented towards the protein with the main- 
chain amino and carbonyl groups exposed at the very top, together 
with the side-chain hydroxyl groups of Ser91, Ser92 and Thr 135 
(Fig. 2b, c). These loops may adopt a different conformation to expose 


Figure 2 | Crystal structure of rubella virus Ele. a-c, Linear (a) and three- 
dimensional (b, c) diagrams of Ele colour-coded by domain, with disulphide 
bonds in green. DI-DIII denote domains I to III (see Supplementary 
Information for a description of class II domains), with the stem (S; sequence 
displayed in a) in magenta and the DI-DII linker in cyan. FL1 and FL2 denote 
fusion loops 1 and 2. In a, the transmembrane (TM) and cytoplasmic (C) 
segments are indicated in white. Grey fonts in the stem sequence mark residues 
absent from the structure. Framed section denotes the beginning of the 
transmembrane segment. Black triangles denote O-linked glycan sites. Arrows 
point to the metal sites (blue spheres). Aromatic residues at the membrane- 
contact area are shown as ball-and-stick; the 1, 2, 3, 4 arc denotes tyrosine 
residues 95, 101, 112 and 108 in one subunit. The trimer axis is shown in red in 
b (side view) and c (top view, red dot). Domain II insertions in rubella virus E1 
(see Supplementary Fig. 3) are coloured grey in one subunit in c. d, Surface 
representations viewed from top, side and bottom, coloured as above. Glycan 
atoms are shown as silver spheres. 
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Table 1 | Dali scores comparing class II viral fusion proteins 


1 2 3 4 
Rubella virus E1* Alphavirus E1 Flavivirus E Class III viral fusion proteins 
Z n/n o % Zz n/n o % 2 n/n 6 % n/m oo % 
« |A:BY 49.5 421/ 0.4 100 | SFV 94 278/ 7.2 10 TBEV 9.4 268/ 7.1 10 | VSVG 19  74/ 35. 9 
A o 424 391 387 409 
S$ |Ca?* 48.7 424/ 0.5 100 DEN2 8.9 264/ 73 9 HSV1gB 1.2 43/ 33 9 
= 431 380 581 
DEN1 8.6 266/ 7.6 10 
379 
Z n/n o % Z n/n oo % Z n/m oo % 
A:B+ 49.3 389/ 0.7 100 | TBEV 14.2 325/ 47 8 VSVG 0.1 102/ 59 10 
3 391 382 409 
3 SFV: 37.7 376/ 3.0 62 DEN2 13.3. 320/ 51 8 HSV1gB 0.5 39/ 69 3 
BS See A2 CHV: 390 380 576 
= DEN1 13.7. 324/ 52 10 
| 379 
TBEV 109: 3117 “7:0 9 
CHVt 395 
Z n/n oo % Z n/m oo % 
TBEVA: 50.2 387/ 0.8 100) HSVlgB 03 35/ 43 0 
o By 388 379 
3 DENI: 41.1 372/ 1.0 68 
Cs See A3 See B3 DEN2 379 
& TBEV: 35.3 369/ 2.0 40 
DEN2 380 
TBEV: 344 369/ 19 38 
DEN1 379 
Z n/ma oo % 
VSVGA: 46.0 409/ 1.0 100 
= By 410 
dD 8 HSV1: 27.4 474/ 43 28 
= See A4 See B4 See C4 EBVgB 576 
HSV1gB: 5.7. 107/ 45 14 
gp64 418 
VZVG: 5.3 211/ 95 4 
gp64 409 
Each row and column includes viruses from the same genus, except for row D and column 4, which include class III protein controls from viruses of different families or subfamilies. The table also gives the variation 
range: Z scores under 2 are meaningless, whereas Z ~ 50 corresponds to the same protein in different crystalline environments. n/n; indicates the number (n) of aligned residues compared to the total residues in 


the alignment (n,). ¢ indicates the root mean square deviation (in angstroms) between backbone a-carbon atoms. % indicates the percentage amino acid identity after structural alignment of the proteins (See 
‘Evolutionary pace’ in Supplementary Information). Unless otherwise specified, the comparisons in rows B and C involve the post-fusion form of SFV El (row B) and the post-fusion forms of the flavivirus E proteins 


(row C). The data to the left and the right of the diagonal are identical. 


*All comparisons using rubella virus Ele were done with coordinates in which domain III was manually ‘unswapped’ to make a trimer organized in the same way as alphavirus E1 or flavivirus E(Supplementary Fig. 


2). Without this operation, the Dali score was based on fewer residues and the Z-score value was poorer. 


+A:B denotes chain A over chain B within the same trimer, in cases where the chains are in different crystalline environments. Ca?* indicates a comparison between Ca?*-bound and Na*-bound Ele. 
{This comparison is between pre-fusion forms: SFV:CHV refers to pre-fusion E1 from SFV! compared to pre-fusion E1 from CHV?*. TBEV:CHV denotes a comparison between the TBEV E protein®?*** and the CHV 


E1 protein, both in pre-fusion form. 


CHV, chikungunya virus; DENn, dengue virus serotype n2°?; EBVgB, Epstein-Barr virus glycoprotein B?°; gp64, baculovirus glycoprotein 64 (ref. 30); HSV1gB, herpes simplex virus type 1 glycoprotein B78; SFV, 


Semliki forest virus; TBEV, tick-borne encephalitis virus; VSVG, vesicular stomatitis virus glycoprotein?’. 


the aromatic side chains during interactions with the membrane. We 
indeed identified some degree of conformational flexibility in FL2 in 
the various crystal forms analysed (see below). The functional import- 
ance of this region is also highlighted by the G93D mutation in FLI, 
which resulted in a 3-log reduction in virus infectivity’. 

A metal ion is coordinated by Asn 88 in FL1 and Asp 136 in FL2, 
through main- and side-chain atoms (Figs 2 and 3). The metal site is a 
unique feature of rubella virus E1, as there is no metal in the other class 
II fusion proteins, nor in the fusion peptide/loop of any other viral 
fusion protein reported to date. It is reminiscent of the metal-ion- 
dependent ligand-binding site of the T-cell immunoglobulin and 
mucin (TIM) family proteins’* shown in Supplementary Fig. 5. TIM 
proteins specifically bind phosphatidylserine lipids on membranes. As 
described in Supplementary Information, the Ele metal site can bind 
Na* or Ca** ions, the latter with higher affinity. We found that the 
conformation of FL2 depends on the metal bound (Fig. 3, Supplemen- 
tary Fig. 5 and Supplementary Video 2). Also, liposome flotation 
assays showed that in the presence of Ca** Ele binds membranes at 
both acidic and neutral pH, whereas in its absence it only binds at 
acidic pH (Fig. 3). Given that the extracellular environment is in gene- 
ral at neutral pH with Ca?* concentrations in the millimolar range”, 
this in vitro behaviour may have biological relevance, suggesting that 
Ca?*-bound rubella virus E1 could serve for virion attachment directly 
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to the cell membrane. A role for El binding to lipids at neutral pH 
would be in line with reports indicating that certain lipids inhibit 
rubella virus infection’*. As discussed in Supplementary Information, 
the electrostatic potential at the membrane-fusion surface switches 
from neutral to basic at the centre at low pH (Supplementary Fig. 6). 
This feature is independent of the nature of the metal bound, and may 
also be related to the type of membrane interactions of E1 in internal 
acidic compartments of the cell, some of which display negatively 
charged lipids on the luminal side (for instance, late endosomes’®). 
An important neutralizing epitope has been mapped to amino acids 
223-239 of rubella virus El (ref. 17), a segment that includes disulph- 
ide bond 8 (DB8; see Supplementary Fig. 4). Antibodies binding to this 
epitope are abundant in rubella-virus-positive individuals, and their 
presence is used as a correlate for protection against viral disease. 
Serum from congenital rubella syndrome patients who are persistently 
infected despite having high levels of rubella-virus-specific antibodies 
have little or no reactive antibodies against the DB8 epitope’. 
Although the epitope is clearly accessible on infectious virions, the 
structure shows that it is completely buried at the centre of the Ele 
trimer (Supplementary Fig. 7), so that it becomes inaccessible upon E1 
trimerization for fusion. The structure therefore predicts that anti- 
bodies bound to this site would block E1 trimerization for entry, 
thereby revealing the neutralization mechanism, although we cannot 
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Figure 3 | Metal and pH dependence of liposome binding. a, b, Close-up of 
the metal site occupied by Na’ (blue sphere; a) or €aet (green sphere; b) ions, 
coordinated between FL1 and FL2, as seen in our structures at pH 8.0. Note the 
change in Asp 136 (grey arrow; see Supplementary Video 2) and the bound 
acetate ion in b. c, d, Liposome binding at pH 5.5 (c) or 7.5 (d) as determined by 
co-flotation in sucrose gradients. Numbers 1, 2 and 3 correspond to different 
lipid compositions (see Methods) in the absence (EDTA) or presence (Ca?*) of 
calcium ions. Top (T), middle (M1, M2) and bottom (B) fractions were 
analysed by western blot. 


rule out an additional block by the antibodies at a stage before endo- 
somal uptake. As discussed in Supplementary Information, the loca- 
tion of this exposed epitope on El also provides clues about the way the 
molecule may be presented at the surface of infectious virions 
(Supplementary Fig. 8). In particular, it suggests that, unlike alpha- 
viruses and flaviviruses, mature rubella virions may not display a rigid 
glycoprotein shell covering the viral membrane, in line with very 
recent electron cryo-tomographic studies on mature rubella virions”. 

The structural comparisons in Table 1 show that the fusion glycopro- 
teins encoded by viruses of the two genera (Alphavirus and Rubivirus) 
present in the Togaviridae family are much more distant to each other 
structurally than are their counterparts from different subfamilies in the 
Herpesviridae family, for example. Also, despite belonging to different 
viral families, the alphavirus and flavivirus envelope proteins seem to 
have undergone a more conservative evolution (Table 1), perhaps 
because of stronger constraints exerted on viruses alternating between 
arthropod and vertebrate hosts. Infecting only humans, rubella virus 
thus seems to have evolved further to occupy a distant niche. A similar 
process may have occurred with the strictly human hepatitis C virus in 
the Flaviviridae family”. 

In summary, the structure of rubella virus E1 revealed a class II 
fusion protein that has evolved a number of specific features, placing 
it apart from the class II fusion proteins of known structure. In par- 
ticular, its unique fusion surface (Supplementary Video 1) has not been 
described for any other viral fusion protein. The ability of Ele to form 
two-dimensional crystalline arrays on membranes (Fig. 1) now offers 
the opportunity to study the membrane-bound conformation of the 
fusion surface by electron crystallography. Comparative structural 
studies such as this one, revealing both specific elements in each virus 
and functional parallels across families, may help to identify common 
essential features that could constitute potential Achilles heels against 
which to direct antiviral strategies. 


METHODS SUMMARY 


Rubella virus Ele was cloned and produced as a secreted protein from Drosophila 
melanogaster S2 cells as explained in Methods. Purified Ele, alone or co-floating 


LETTER 


with liposomes in a sucrose gradient, was analysed by staining with 2% (w/v) 
uranyl acetate with a Phillips CM12 microscope operated at 120 kV. Images were 
recorded in low-dose mode at a nominal magnification of * 42,000 in a KeenView 
Soft Imaging System. 

Ele crystals grew overnight in 1-3% polyethylene glycol 4000, 0.1 M HEPES 
buffer, pH 7.5-8.0, and 30% glycerol. Heavy-atom derivative crystals were obtained 
by soaking crystals overnight in 8% polyethylene glycol 4000, 0.1 M HEPES, pH 7.5, 
30% glycerol and 10 mM potassium hexachlororhenate. Diffraction data were col- 
lected at 100 K at synchrotron sources to 1.78 and 3.40 A resolution for native and 
rhenium derivative crystals, respectively. The structure was determined and refined 
as described in Methods. All data collection and refinement statistics are listed in 
Supplementary Table 1, and the lipid-head-groups tested in co-crystallization trials 
are listed in Supplementary Table 2. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Ele production. Codon-optimized synthetic cDNA corresponding to rubella 
virus envelope glycoproteins E2 and E1 was cloned into a modified pMT plasmid 
engineered for production of secreted protein from Drosophila melanogaster S2 
cells. E2 is believed to chaperone the correct folding of El. The amino-acid 
sequence corresponds to strain M33 (Swiss-Prot accession number P08563) from 
position 301 (the N terminus of E2) to position 1018 (that is, before the El 
transmembrane segment) of the polyprotein precursor (Supplementary Fig. 1a). 
The pMT plasmid was modified to add C-terminal residues: FEDDDDKAG 
WSHPQFEKGGGSGGGSGGGSWSHPQFEK (an enterokinase cleavage site is 
underlined and two Strep-tactin purification tags are in italics). The plasmid 
contains a metallothionein promoter for induction of gene expression. For trans- 
location into the endoplasmic reticulum lumen, the plasmid encodes the signal 
peptide of the Drosophila endoplasmic reticulum-resident binding immuno- 
globulin protein immediately upstream of E2. 

The resulting Ele expression plasmid along with a hygromycin resistance plas- 
mid (pCoHygro) was used to obtain stably transfected S2 cells. Cells were grown to 
~1X 10’ cells ml‘ in Insect-XPRESS medium and expression was induced with 
500 uM CuSO. After 3 days, 100-fold concentrated $2 media supernatant was 
supplemented with 10 1g ml‘ avidin and 0.1 M Tris buffer, pH 8.0, and Ele was 
purified on a Strep-tactin affinity column. Ele was further purified by Superdex 
200 16/60 size-exclusion column in 10mM Tris, pH 8.0, and 150mM NaCl. 
Trimeric Ele was dialysed overnight into 10mM Tris, pH 8.0, with no NaCl 
and concentrated to 2.0 mg ml’. 

Ele structure determination. Crystals were grown overnight in 1-3% polyethyl- 
ene glycol (PEG) 4000, 0.1 M HEPES buffer, pH7.5, and 30% glycerol. Heavy- 
atom derivative crystals were obtained by soaking overnight in 8% PEG 4000, 
0.1M HEPES, pH 7.5, 30% glycerol and 10 mM K,ReCl,. Crystals were plunged 
directly into liquid nitrogen. Diffraction data were collected at the Swiss Light 
Source and at SOLEIL synchrotrons. Because of clear damage at some disulphide 
bonds, the final data for refinement was collected with low beam intensity at 
cryogenic temperature on the Pilatus detector at the PX I beam line of the Swiss 
Light Source synchrotron and were processed with XDS"'. Two single-wavelength 
anomalous dispersion (SAD) data sets were collected from derivative crystal Re-1 
and one SAD data set was collected from derivative crystal Re-2 using the inverse 
beam method at the peak of the rhenium absorption edge (1.1762 A). SCALA 
combined the three SAD data sets and showed anomalous signal to 4.6 A resolu- 
tion. Location of eight rhenium sites was performed using SHELXC/D* within 
autoSHARP”. Eight additional sites were found with SHARP™. The figure of merit 
was 0.21 for initial phases calculated from a total of 12 rhenium sites. Solvent 
flattening with SOLOMON” gave initial SAD phases. PROFESS identified nine 
rhenium sites (out of the 12 found) with three-fold symmetry and LSQKAB 
located the three-fold non-crystallographic axis. Density modification was per- 
formed with 1.78 A native data using DM” and the figure of merit increased to 
0.80. An initial model was built with ARP-wARP” and refinement and rebuilding 
was carried out with BUSTER* and Coot*, respectively. In BUSTER, the local 
structural similarity restraints” and the autoncs method were applied. Refinement 
was monitored following the Rgee value calculated for a random subset (10%) of 
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reflections omitted from refinement. Data collection and refinement statistics are 
listed in Supplementary Table 1. The programs SCALA, PROFESS, LSQKAB and 
DM are part of the CCP4 programs suite’. Supplementary Fig. 4c shows the final 
electron density in one of the regions of poorest quality, the linker between 
domains I and III. 

Metal site analysis. On the basis of several structural validation criteria and 
further crystallization studies described in Supplementary Information, we iden- 
tified that the Na* had been replaced by Ca’~. 

Liposome flotation experiments. Liposomes were freshly prepared by the freeze- 
thaw and extrusion method*'. DOPS (1,2-dioleoyl-sn-glycero-3-phospho-L-serine), 
DOPE (1,2-dioleoyl-sn-glycero-3-phosphoethanolamine), DOPC (1,2-dioleoyl- 
sn-glycero-3-phosphocholine), cholesterol and sphingomyelin were purchased 
from Avanti Polar Lipids. Type 1, 2 and 3 liposomes were formed by molar ratios 
of 1/1/2 of DOPE/DOPS/DOPC, 1/1/2/2 of DOPE/DOPS/DOPC/cholesterol, and 
1/1/2/2/2 of DOPE/DOPS/DOPC/cholesterol/sphingomyelin. Ele (1mM) and 
liposomes (1 mM) were incubated overnight at 30°C in the absence (1mM 
EDTA) or presence (10 mM Ca(CH3COO),) of calcium ion at pH5.5 (10mM 
sodium phosphate buffer) and pH 7.5 (10mM Tris-Cl) in 100 pl. To keep the 
amount of sodium ions constant at each pH, 10 mM NaCl was added to the samples 
with Tris. Samples were then mixed with an equal volume of 40% sucrose, loaded 
into a 5-40% sucrose gradient (at pH 5.5 or 7.5) and centrifuged overnight at 40,000 
r.p.m.at4°Cina SW 55 Tirotor. Fractions from top, middle and bottom regions of 
the gradient were analysed by western blot using an anti-Strep-tag antibody. 
Electron microscopy. Samples were negatively stained with 2% (w/v) uranyl 
acetate and screened in a Phillips CM12 microscope operating at an accelerating 
voltage of 120kV. Images were recorded in low-dose mode at a nominal mag- 
nification of X 42,000 in a KeenView Soft Imaging System. 
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RNAi triggered by specialized machinery silences 
developmental genes and retrotransposons 


Soichiro Yamanaka’, Sameet Mehta’, Francisca E. Reyes-Turcu', Fanglei Zhuang”, Ryan T. Fuchs’, Yikang Rong’, Gregory B. Robb” 


& Shiv I. S. Grewal! 


RNA interference (RNAi) is a conserved mechanism in which small 
interfering RNAs (siRNAs) guide the degradation of cognate RNAs, 
but also promote heterochromatin assembly at repetitive DNA ele- 
ments such as centromeric repeats’”. However, the full extent of 
RNAi functions and its endogenous targets have not been explored. 
Here we show that, in the fission yeast Schizosaccharomyces pombe, 
RNAi and heterochromatin factors cooperate to silence diverse 
loci, including sexual differentiation genes, genes encoding trans- 
membrane proteins, and retrotransposons that are also targeted by 
the exosome RNA degradation machinery. In the absence of the 
exosome, transcripts are processed preferentially by the RNAi 
machinery, revealing siRNA clusters and a corresponding increase 
in heterochromatin modifications across large domains containing 
genes and retrotransposons. We show that the generation of siRNAs 
and heterochromatin assembly by RNAi is triggered by a mech- 
anism involving the canonical poly(A) polymerase Plal and an 
associated RNA surveillance factor Red1, which also activate the 
exosome. Notably, siRNA production and heterochromatin modi- 
fications at these target loci are regulated by environmental growth 
conditions, and by developmental signals that induce gene express- 
ion during sexual differentiation. Our analyses uncover an inter- 
action between RNAi and the exosome that is conserved in 
Drosophila, and show that differentiation signals modulate RNAi 
silencing to regulate developmental genes. 

In S. pombe, centromeric transcripts are processed by RNAi factors 
including the Argonaute (Agol)-containing RNA-induced transcrip- 
tional silencing (RITS) complex, the RNA-dependent RNA poly- 
merase (Rdpl) and Dicer (Dcr1)’. Processing of transcripts into 
siRNAs is coupled to loading of Clr4, a homologue of mammalian 
SUV39H, required for histone H3 lysine 9 methylation (H3K9me)**. 
Whereas most siRNAs correspond to centromeric repeats”, siRNAs 
that match other repetitive structures such as Tf2 retrotransposons 
have not previously been detected. 

The 3’-5’ exonuclease activity of the exosome that also processes 
heterochromatic transcripts’’ acts in an overlapping manner to RNAi 
to silence centromeric transcripts’®, suggesting that RNAi and the 
exosome share targets. Recent studies have revealed widespread tran- 
scription of the genome”''”, producing transcripts that are regulated 
by the exosome’. Under specific growth or developmental conditions, 
some of these transcripts may also feed into the RNAi pathway to 
generate siRNAs for heterochromatin assembly. Such RNAi targets 
might be detected in an exosome mutant, in which transcripts would 
be processed preferentially by RNAi machinery. 

We performed high-throughput sequencing of small RNAs in cells 
lacking the exosome subunit Rrp6 to identify RNAi targets throughout 
the genome. We observed small RNA clusters mapping to various 
genomic locations. Equal densities of small RNA reads match forward 
and reverse strands of specific loci, and are abolished in RNAi mutants 
(Supplementary Table 1 and Figs 1 and 2). Overlapping transcription 
at convergent genes could induce RNAi”, but siRNA clusters are also 
observed at non-convergent loci, indicating an alternative mechanism. 


Previously unknown RNAi targets include loci in subtelomeric 
regions as well as protein-coding genes, and siRNA clusters correlate 
with H3K9me enrichment (Supplementary Fig. 1 and Supplementary 
Table 1). H3K9me peaks are dependent on RNAi (Figs 1 and 2), and 
are distinct from heterochromatin islands that can be assembled 
without RNAi’. Closer examination of heterochromatin domains 
(HOODs) formed by RNAi shows that H3K9me and siRNA clusters 
are preferentially targeted to genes upregulated during sexual differ- 
entiation, as well as genes encoding transmembrane domain proteins 
(Supplementary Table 1). Several siRNA clusters overlap non-coding 
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Figure 1 | siRNA clusters and RNAi-dependent assembly of 
heterochromatin at gene-containing regions. a, The normalized number of 
small RNA reads are plotted. The signals above and below the line represent 
small RNAs that map to the top and bottom DNA strands, respectively. siRNA 
mapping to myp2 and SPCC1442.04c (1442.04c) and their length profiles are 
shown. nt, nucleotides. b, Chromatin immunoprecipitation (ChIP) showing 
Chp1 enrichment at the indicated loci. The intensities of bands representing 
each locus and the lew1 control in ChIP and whole-cell crude extract (WCE) 
lanes were used to calculate the relative fold enrichments shown. c, Northern blot 
with probes specific to myp2 or SPCC1442.04c was used to detect siRNAs. 
Transfer RNA serves as a loading control. d, The H3K9me2 was mapped by 
ChIP-chip. siRNA clusters and H3K9me map to the open reading frames of 
multiple genes, but their distribution decreases sharply, in some cases coinciding 
with transcription factor TFIIIC-binding sites, such as tRNAs, that are known to 
serve as heterochromatin boundaries. e, Detection of antisense transcripts 
generated from myp2 or SPCC1442.04c loci was performed using PCR with 
reverse transcription (RT-PCR). The act1 transcript level was used as a control. 
RT(+) and RT(—), reverse transcriptase present or absent, respectively. 
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Figure 2 | Tf2 retrotransposons produce siRNAs and show RNAi- 
dependent heterochromatin formation. a, The small RNA reads that map to 
the region containing Tf2-5 in the indicated strains are plotted. b, The 
H3K9me2 distribution was determined by ChIP-chip. c, The length profile of 
small RNAs mapping to the antisense strand of Tf2 is shown. The inset graph 
shows the frequency of the base at the 5’ end of reads that map to Tf2. 

d, Northern blot with probes specific for Tf2 was used to detect siRNAs. tRNA 
serves as a loading control. e, Effects of Clr4 and Rdp1 on siRNAs mapping to 
Tf2. The small reads mapping to Tf2-5 in the indicated strains are plotted. 

f, Analysis of Clr4 and Rdp1 requirement for production of T/2 siRNAs. Results 
of the northern blot analysis using the indicated strains are shown. 


RNA (ncRNA; Supplementary Table 1)'°, which could potentially 
trigger RNAi". 

Small RNAs that map to genes are predominantly 20-24 nucleotides 
long (Fig. 1a), consistent with RITS-associated siRNAs’. Indeed, RITS 
subunit Chp1 localized to these loci (Fig. 1b). Small RNA production 
also requires Agol (Fig. 1a). The remaining reads in agolA rrp6A 
display a broad length distribution consistent with degradation pro- 
ducts (Supplementary Fig. 2). Agol-dependent production of siRNA 
from myp2 and SPCC1442.04c was confirmed by northern blot 
(Fig. 1c), and is required for H3K9me at these loci (Fig. 1d and 
Supplementary Fig. 3a). In rrp6™ cells, loss of Agol, Derl, Rdp1 or 
Clr4 stabilized antisense transcripts from myp2 and SPCC1442.04c 
(Fig. le), implicating RNAi and Clr4 in the regulation of these loci 
in wild-type cells. 

Retrotransposons are silenced by several mechanisms to ensure 
genome stability'’. In S. pombe, centromere protein CENP-B homo- 
logues and chromatin modifiers silence Tf2 retrotransposons and their 
remnants'*”’. RNAi mutants show mild Tf2 derepression and siRNAs 
mapping to these loci have not been detected**”. We observed siRNA 
clusters and H3K9me in rrp6A at Tf2 (Fig. 2a, b and Supplementary 
Fig. 3b). Small RNAs that map to the antisense strand show a pre- 
ference for uridine at the 5’ end and a length profile of 20-24 nucleo- 
tides, whereas a portion of those mapping to the sense strand are 
probably degradation products (Fig. 2c). H3K9me is restricted to the 
Tf2 open reading frame except for Tf2 retrotransposons located in 
close proximity where H3K9me could be detected at intervening 
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regions (Supplementary Fig. 4a). siRNA generation and H3K9me 
at Tf2 requires Agol and Dcr1 (Fig. 2a, b, d and Supplementary 
Fig. 4a). RNAi is also required for H3K9me and siRNA clusters at 
other repetitive elements, including a membrane transporter gene 
duplicated in tandem and flanked by long terminal repeats (LTRs; 
Supplementary Fig. 4b). 

We next asked whether Clr4 is required for generation of siRNAs in 
a manner similar to its role at centromeres’. Small RNA profiling of 
rrp6A clr4A revealed that Clr4 is essential for siRNA production at Tf2 
(Fig. 2e, f), genes (Supplementary Fig. 5 and Supplementary Table 1) 
and regions with little or no H3K9me (Supplementary Fig. 6a, b). Loss 
of Rdp1, which requires Clr4 for chromatin association’, also abolished 
siRNA clusters (Fig. 2e, f and Supplementary Fig. 5). Thus, hetero- 
chromatin and RNAi factors cooperate to generate siRNA clusters. 

We also explored assembly of HOODs in wild-type S. pombe cells 
cultured under varied growth conditions experienced in the natural 
environment. HOODs were observed in cells grown in carbon-limiting 
conditions or at low temperature (Supplementary Fig. 7). In low glu- 
cose, cells showed elevated levels of H3K9me at several HOODs 
including Tf2, SPBC2D10.04 and the meiotic mcp5 gene (Supplemen- 
tary Fig. 7a, 8). Importantly, generation of siRNAs and HOOD for- 
mation at these loci were impaired in ago1A (Supplementary Fig. 8). 
Thus, assembly of HOODs is regulated by growth conditions in wild- 
type cells. 

We investigated the functional significance of RNAi and hetero- 
chromatin factors in silencing target loci. Cells lacking Clr4 or RNAi 
in combination with rrp6A showed severe defects in silencing of Tf2 
and sexual differentiation genes (for example, mug5, mcp3, mek1 and 
SPCC1442.04c), in contrast to mild or no defects in single mutants 
(Fig. 3 and Supplementary Fig. 6c). Transcript levels were higher when 
rrp6A was combined with RNAi mutants than when combined with 
clr4A (Fig. 3a). These data clearly show that both RNAi and the 
exosome are important to fully silence these target loci. 

Targeting of centromeric repeats by RNAi confers epigenetic 
repression on surrounding loci’***. We found that both RNAi and 
the exosome coordinate the silencing of multiple loci within HOODs 
(Fig. 3b-d). Importantly, deletion of man1 caused loss of H3K9me and 
defective silencing of surrounding genes (Fig. 3c, d). This finding 
reveals that nucleation of heterochromatin at genes via RNAi can 
modulate expression of other genes in cis. 

Several siRNA clusters map to known targets of the RNA surveil- 
lance factor Red1, which silences sexual differentiation genes, genes 
encoding transmembrane proteins and 772 (ref. 24). Red1 interacts 
with the poly(A) polymerase Plal and degrades transcripts via a 
mechanism involving the poly(A)-binding protein Pab2 and the exo- 
some**. Combining rrp6A with red1A, pab2A or pla1-37 caused a 
reduction in siRNAs and H3K9me at Tf2 and most genes (Fig. 4a, b 
and Supplementary Figs 1, 9-11 and Supplementary Table 2). By 
contrast, loss of the RNA-binding protein Mmil that directs the exosome 
and RNAi to certain meiotic loci” had no effect on siRNAs and H3K9me 
at most HOODs, except for a few loci including man1 (Supplementary 
Figs 5 and 10 and Supplementary Table 2). Interestingly, defects in 
Plal or Pab2 caused a reduction of H3K9me and siRNA clusters at 
SPCC1442.04c and ste6, whereas red1A had little effect (Fig. 4c). 
Therefore, Plal and Pab2 may represent core machinery that coope- 
rates with other factors, as dictated by genomic contexts, to direct 
H3K9me by RNAi. 

We asked whether Plal channels transcripts into the exosome and 
RNAi pathways. Our analysis revealed increased stabilization of Tf2 
and gene transcripts in pla1-37 rrp6A, similar to ago1A rrp6A or dcr1A 
rrp6A (Fig. 4d). Together with previous findings**”’, these results indi- 
cate that Plal and associated factors such as Red1 process RNAs that 
feed into the exosome and RNAi machinery. Indeed, loss of both Rrp6 
and Agol was required to abolish H3K9me at mcp5, whereas red1A 
single deletion eliminated heterochromatin modification at this locus 
(Supplementary Fig. 12). 
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Figure 3 | RNAi and heterochromatin factors act in an overlapping manner 
with the exosome to silence genes located in HOODs. a, The role of Clr4 and 
RNAi in silencing myp2, SPCC1442.04c and Tf2 was analysed by northern blot 
analysis with a strand-specific probe to detect sense transcript. rRNA, 
ribosomal RNA. b, The H3K9me distribution across HOOD-15 as determined 
by ChIP-chip analysis of wild-type and rrp6A cells is shown. c, The effect of 
deleting the HOOD-15 man1 gene, which contains a major siRNA cluster, on 
H3K9me at the nearby mek1 and SPAC14c4.06c loci was determined by ChIP. 
d, The effect of man1A on silencing of other genes in HOOD-15 was 
determined by strand-specific RT-PCR. 


Because many of the RNAi targets are developmentally regulated, 
we asked whether assembly of HOODs might be regulated in a similar 
manner. We induced sexual differentiation in nitrogen-starved M 
(mat1-M) cells through ectopic expression of mat1-Pc. Whereas nitro- 
gen starvation alone had no effect, mat1-Pc expression caused the loss 
of siRNAs and H3K9me at myp2 and SPCC1442.04c (Fig. 4e). siRNAs 
that map to Tf2 were still detected (Fig. 4f), indicating that devel- 
opmental signals selectively target HOODs that contain genes acti- 
vated during sexual differentiation. 

Finally, we explored the connection between RNAi and the exosome 
in higher eukaryotes. Small-RNA profiling of Drosophila rrpo’ 
revealed an increase in reads that map to repeat elements, as compared 
with control rrp6*’ larvae (Supplementary Fig. 13a, b). These small 
RNAs are predominantly 25-28 nucleotides and frequently begin with 
uracil, similar to Piwi-interacting RNA (piRNAs; Supplementary Fig. 
13c, d)'’. The sense and antisense transcripts of retroelements are 
upregulated in rrp6" larvae (Supplementary Fig. 13e). An increase 
in small RNAs that correspond to piRNA clusters (such as 42AB) was 
also observed (Supplementary Fig. 14), indicating that RNAi processes 
repeat transcripts in the absence of the exosome. 

Our analyses indicate that RNAi collaborates with heterochromatin 
factors to silence developmentally regulated loci and retrotransposons, 
which are also targeted by the exosome. Although not all loci displaying 
siRNA clusters in rrp6A might be bona fide targets, RNAi regulates 
several of these loci in response to physiological conditions in wild- 
type cells. The exosome may modulate the level of transcripts and/or 
siRNAs that trigger RNAi in response to developmental and environ- 
mental signals. In this regard, RNAi serves as an adaptive response 
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Figure 4 | Specialized machinery triggers RNAi to regulate developmental 
genes and silence retroelements. a—c, Small RNA reads in the indicated strains 
are plotted along the genomic region of HOOD-10 (a), HOOD-3 (b) and 
HOOD-31 (c). Because pla] is essential, a partial loss-of-function pla1-37 allele 
was used. H3K9me distribution was determined by ChIP-chip. d, Northern 
blot analysis of sense transcripts produced from myp2, SPCC1442.04c and Tf2 
loci in the indicated strains. e, Induction of sexual differentiation by ectopic 
expression of mat1-Pc (Pc) in M (mat1-M) cells under nitrogen starvation 
conditions was used to assay siRNA and H3K9me at the myp2 and 
SPCC1442.04c loci. f, Detection of siRNAs mapping to Tf2 under conditions 
that induce sexual differentiation. g, Schematic representation of the current 
model. Transcripts from sexual differentiation genes, genes containing 
transmembrane domains and retroelements are recognized by a mechanism 
involving Plal, Red1 and Pab2, which promotes degradation of RNAs by RNAi 
and/or the exosome. 


mechanism that reprograms the genome under specific growth con- 
ditions. RNAi is triggered at developmentally regulated loci by special- 
ized machinery that includes Red1 and Plal (Fig. 4g). These factors 
redistribute during sexual differentiation’, and may be targets of a 
signalling cascade that regulates siRNA production. Other poly(A) 
polymerases, including Mlo3-TRAMP (Trf4—Air2—Mtr4 polyadenyla- 
tion, note that Trf4 and Air2 are known as Cid14 and Air1, respectively, 
in S. pombe) that also activates both the exosome and RNAi’, may share 
targets with Plal and Red1. Indeed, Mlo3 and TRAMP are required for 
silencing Tf2 and duplicated genes’. Future studies may reveal inter- 
action between RNA processing activities required for gene silencing 
during development and for genome stability in S. pombe and higher 
eukaryotes. 


METHODS SUMMARY 


ChIPs were performed using an anti-H3K9me antibody (Abcam) as described 
previously’. The preparation of small RNA libraries is described in the Methods. 
For detection of small RNAs by northern blot, 10 pg of small RNAs, purified using 
a mirVana miRNA isolation kit (Ambion), were resolved on a 15% denaturing gel, 
crosslinked to neutral nylon membranes (Amersham/Pharmacia) and detected 
using a ~50-nucleotide **P-labelled single-stranded RNA probe. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Strains and media. Standard protocols were used for cell culture, sporulation and 
tetrad analysis. Deletion of endogenous man1 was performed by PCR-based gene 
targeting with clonNAT resistant genes as selection markers. Because the plal gene 
is essential for growth, a partial loss-of-function mutant allele of pla1 (pla1-37) (a 
gift from M. Yamamoto’s laboratory) was used. Wild-type and mutant cells were 
grown at 30°C or 32°C in either rich (YEA) or minimal (EMM) media. For 
nitrogen starvation, cells were cultured in EMM medium lacking nitrogen source. 
Strains and oligonucleotides used in this study are listed in Supplementary Tables 
3 and 4, respectively. 

Small RNA library construction. Small RNA fractions <40 nucleotides were 
prepared by gel purification from denaturing acrylamide gels, or by using flashPAGE 
fractionation (Life Technologies) according to the manufacturer’s recommenda- 
tion**. Small RNAs were treated with calf intestinal alkaline phosphatase (CIP) for 
1 hat 37 °Cata ratio of 1 unit of CIP (New England Biolabs (NEB)) per pmol of RNA 
to remove mono-, di- and triphosphates. After phenol/chloroform/isoamyl alcohol 
extraction and ethanol precipitation, 3’-adapters (5'-AppTCGTATGCCGTCTT 
CTGCTTGT-NH,-3’, App denotes phosphorylated adenosine) were ligated over- 
night at 16°C under conditions that are insensitive to 2'-O-methylation of the 
terminal nucleotide (50mM Tris pH7.5, 10mM MgCh, 1mM DTT, 12.5% 
PEG.) Typical ligations contained ~1-3pmol (~10-30ng) of small RNA. 
Adapters were in molar excess of RNA, and reactions contained 200 units of T4 
Rnl2 truncated enzyme and 20 units of murine RNase inhibitor in a total volume 
of 20 pl. Ten microlitres of the 3’-ligated RNA was phosphorylated by adding 3 il 
of water, 0.5 pl of T4 Rnll buffer and 0.5 ul (20 U) of murine RNase inhibitor, and 
1 pl (10 units) of T4 PNK (NEB) followed by incubation at 37 °C for 30 min. Final 
buffer composition for this reaction was 50 mM Tris pH 7.5, 10 mM MgCl, 1 mM 
DTT, 8.25% PEG, 0.33mM ATP in a 15ul reaction volume. The kinase was 
inactivated and the reverse transcription primer was annealed by adding a twofold 
molar excess (compared to 3’ -linker) of reverse transcription primer and water ina 
volume of 5 pl. The 20 ul mixture was heated to 75°C for 10 min, incubated at 
37°C for 10min, and then cooled to 25°C. 22.5 pmol of 5’-adapters (5’-rGr 
UrUrCrArGrArGrUrUrCrUrArCrArGrUrCrCrGrArCrGrArUrC-3’), 20 units of 
murine RNase inhibitor and 20 units of T4 RNA ligase 1 were added directly to the 
3'-ligations and annealed to small RNA ina total reaction volume of 24 il. Ligation 
reactions were incubated at 25 °C for 1h. Half (12 pl) of the doubly ligated small 
RNA was reverse transcribed using avian myeloblastosis virus reverse trans- 
criptase following the manufacturer’s instructions (NEB) for 30 min at 42°C. 
One-third of the cDNA was PCRamplified for 6-12 cycles using the primers 
5'-CAAGCAGAAGACGGCATACG-s-A-3', and 5'-AATGATACGGCGACC 
ACCGACAGGTTCAGAGTTCTACAGTCCG-s-A-3’ (where -s- denotes phos- 
phorothioate linkage) using 2 LongAmp Taq Mastermix (NEB). PCR reactions 
were separated by native 6% PAGE. Amplified DNA was visualized by ultraviolet 
transillumination using SybrGold nucleic acid stain (Life Technologies), and 
regions of the gel corresponding to RNA insert sizes of 18-40 nucleotides were 
excised, crushed and eluted from the gel in TE pH8 for >2h at 25°C. Libraries 
were concentrated using QiaQuick columns (Qiagen) and analysed using 
DNA1000 chips and reagents on an Agilent 2100 BioAnalyzer (Agilent). 

ChIP analysis. ChIP and ChIP-chip analyses were performed as described previ- 
ously with antibodies against dimethylated H3K9 (Abcam)’. In brief, exponen- 
tially growing cells (5 X 10°) were fixed in 3% paraformaldehyde and chromatin. 
For ChIP of Chpl, cells were subsequently crosslinked by treatment with 
10 mM dimethyl adipimidate. Cells were washed with PBS, resuspended in lysis 
buffer (50 mM HEPES/KOH, pH 7.5, 140mM NaCl, 1mM EDTA, 1% Triton 
X-100, 0.1% deoxycholate) and homogenized with glass beads. Chromatin was 
sheared by sonication to fragments of 500-1,000 base pairs, precleared with 
protein A slurry and immunoprecipitated with 2-4 1g of antibody. Immuno- 
precipitated chromatin was recovered by incubation with protein A or protein 
G slurry, washed extensively and reverse crosslinked by incubation at 65 °C. 
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Immunoprecipitated DNA and DNA isolated from WCE were analysed by multi- 
plex PCR or subjected to microarray-based ChIP-chip analysis. 

Northern blot analysis. Northern blot analysis was performed as described previ- 
ously’. Total RNA was isolated from cells with MasterPure yeast RNA purifica- 
tion kit (Epicentre). 10 pg total RNA was loaded into each lane. Total RNA was 
transferred from gels to a positively charged nylon membrane (BrightStar-Plus, 
Ambion) using a specific transfer buffer (NorthernMax transfer buffer, Ambion). 
The *’P-labelled single-stranded RNA probes were hybridized overnight to the 
membrane in ULTRAhyb buffer (Ambion). 

Detection of small RNAs by northern blot. For small RNA detection, short 
RNAs (<200 nucleotides) were purified with the mirVana miRNA isolation kit 
(Ambion). 10 jg of small RNA was resolved on a 15% denaturing acrylamide gel, 
followed by transfer to a neutral nylon membrane (Amersham/Pharmacia). 
Chemical cross-linking was performed as previously described**. Cross- 
linked membrane was probed with **P-labelled single stranded RNA probes 
(~50 nucleotides) corresponding to Tf2, myp2 or SPCC1442.04c. 

RT-PCR. Strand-specific RT-PCR was carried out as previously described*’. 
DNase-treated total RNA (100 ng), isolated with MasterPure yeast RNA puri- 
fication kit (Epicentre), was reverse-transcribed using the One-step RT-PCR kit 
(Qiagen). 

Drosophila melanogaster genetics. The rrp6 alleles used were {007001, which is a 
piggybac insertion into the rrp6 coding region and was a gift from E. Lei, and 
Df(3R)Exel6172, which is a chromosomal deletion of the rrp6 region and was 
obtained from the Bloomington Stock Center. Homozygous rrp6 mutants were 
generated by crossing heterozygous rrpé6 flies that contained a balancer chro- 
mosome with an actin-driven green fluorescent protein (GFP) marker gene. 
Homozygous larvae were identified by the lack of GFP fluorescence throughout 
the body. Heterozygous siblings were used as controls. Because rrp6“~ flies failed 
to grow beyond the L1 larval stage, we profiled small RNAs from the L1 larvae of 
rrp6’ and rrp6*“~ flies. 

Small RNA-seq analysis. Small RNA was sequenced using Illumina sequencing 
platform. Analysis of deep-sequencing data was performed using the commercial 
program Novoalign (http://www.novocraft.com). Novoalign was used to strip the 
adaptor sequences from the reads, and align them with the reference genome 
assembly of either S. pombe (September 2007 release) or D. melanogaster (NCBI 
release 5_30, 30 May 2011). The output parameters were set to report all valid 
alignments for each read, and only mapped reads were used in further analysis. 
Python and R scripts were used to further analyse the aligned read data. Python 
scripts were used to compile information about genome mapping coverage that 
included the length and the genomic coordinates of each read, and to generate a 
count of the number of times each position in the genome was represented in the 
data set. The coverage count (the number of times a read in the data set was 
mapped to a given location in the genome) was normalized to per million mapped 
reads. For the analysis of repetitive regions in D. melanogaster, the mapped reads 
were processed through the open source program, RepeatMasker*’. RepeatMasker 
revealed the identity and abundance of repeats in the small RNA data set, which 
were plotted using R. 
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TET2 promotes histone O-GlcNAcylation during 


gene transcription 


Qiang Chen', Yibin Chen’, Chunjing Bian!, Ryoji Fujiki?* & Xiaochun Yu! 


Ten eleven translocation (TET) enzymes, including TET1, TET2 and 
TET3, convert 5-methylcytosine to 5-hydroxymethylcytosine' and 
regulate gene transcription” °. However, the molecular mechanism 
by which TET family enzymes regulate gene transcription remains 
elusive*®. Using protein affinity purification, here we search for 
functional partners of TET proteins, and find that TET2 and 
TET3 associate with O-linked B-N-acetylglucosamine (O-GlcNAc) 
transferase (OGT), an enzyme that by itself catalyses the addition of 
O-GlcNAc onto serine and threonine residues (O-GlcNAcylation) 
in vivo’*. TET2 directly interacts with OGT, which is important for 
the chromatin association of OGT in vivo. Although this specific 
interaction does not regulate the enzymatic activity of TET2, it 
facilitates OGT-dependent histone O-GlcNAcylation. Moreover, 
OGT associates with TET2 at transcription start sites. Down- 
regulation of TET2 reduces the amount of histone 2B Ser 112 
GlcNAc marks in vivo, which are associated with gene transcription 
regulation. Taken together, these results reveal a TET2-dependent 
O-GlcNAcylation of chromatin. The double epigenetic modifica- 
tions on both DNA and histones by TET2 and OGT coordinate 
together for the regulation of gene transcription. 

Using protein affinity purification, we sought functional partners 
to the TET family enzymes. Mass spectrometry analysis showed that 
both TET2 and TET3 associate with OGT, a enzyme that catalyses 
O-GlcNAcylation’® (Fig. 1a, Supplementary Table 1 and Supplemen- 
tary Fig. la, b). To confirm this interaction, we performed a co- 
immunoprecipitation assay with overexpressed TET1, TET2, TET3 
and OGT. TET2 and TET3, but not TET1, associated with OGT 
(Fig. 1b and Supplementary Fig. 2a). To study the endogenous inter- 
action, we used mouse embryonic stem (ES) cells in which TET2, but 
not TET3, was expressed’*’. Again, TET2 associated with OGT endo- 
genously in ES cells (Fig. 1c and Supplementary Fig. 2d). To map the 
regions of interaction between TET2 and OGT, we generated a series 
of TET2-deletion mutants, and found that the carboxy-terminal cata- 
lytic double-strand B-helix (DSBH) domain of TET2 interacts with 
OGT (Fig. 1d and Supplementary Fig. 2b). The tertiary structure of 
OGT has been examined'®"’. There are two major functional domains 
in OGT. One is the amino-terminal regulatory domain that is com- 
posed of 13.5 tetratricopeptide repeats (TPR); the other is the 
C-terminal catalytic domain that transfers GlcNAc from UDP- 
GlcNAc to Ser/Thr residues of the substrate. Using the internal- 
deletion mutants, we found that TPR5 and TPR6 of OGT are required 
for the interaction with TET2 (Fig. le and Supplementary Fig. 2c, e, f). 
We also generated baculoviruses encoding either TET2 catalytic 
domain (TET2CD) or OGT. After co-infecting Sf9 cells with these 
two types of baculovirus, the recombinant proteins of streptavidin- 
binding peptide (SBP)-tagged TET2CD and glutathione S-transferase 
(GST)-tagged OGT could be co-purified at 1:1 stoichiometry from cell 
lysates, suggesting TET2 tightly binds OGT (Fig. 1f). Moreover, the 
internal-deletion mutants D4 (which lacks TPR9 and TPR10) and D5 
(which lacks TPR11 and TPR12) of OGT slightly reduced the inter- 
action with TET2, indicating that other TPR repeats may also regulate 


the interaction between OGT and TET2 (Fig. le and Supplementary 
Fig. 2f). In addition, recombinant TET3 catalytic domain also directly 
binds OGT (Supplementary Fig. 2g). Taken together, these results 
demonstrate that TET2 and TET3 form a complex with OGT both 
in vitro and in vivo. 

TET enzymes were originally identified as homologues of JBP1 and 
JBP2 in Trypanosoma brucei; these two thymidine hydroxylases are 
involved in the synthesis of the modified DNA base B-D-glucosyl- 
hydroxymethyluracil (also known as base J)'. Synthesis of base J con- 
sists of two steps in which it is directly converted from thymidine bases 
incorporated in DNA strands. In the first step, JBP1 and JBP2 catalyse 
the oxidation of thymidine to hydroxymethyluracil. During the second 
step, a yet unknown enzyme transfers a glucose residue onto the 
hydroxyl group of hydroxymethyluracil to form base J'’. We wondered 
whether OGT could catalyse the glycosylation on hydroxyl groups 
of 5-hydroxymethylcytosine (ShmC). To test this hypothesis, 5- 
methylcytosine (5mC) was incubated with recombinant TET2 and 
OGT in vitro. However, OGT failed to convert ShmC into any new 
base in vitro (Supplementary Fig. 3a, b). Co-expression of OGT with 
TET2 in cells did not alter the TET2-dependent 5hmC synthesis 
(Supplementary Figs 3c and 4). Furthermore, we could not detect 
glycosylated ShmC in vivo using mass spectrometry (Supplementary 
Figs 3d, e and 5). These results suggested that OGT does not affect 
TET2-dependent 5hmC synthesis. 

Next, we asked whether TET2 regulates the function of OGT. We 
fractionated ES cell lysates using different salt concentrations and pH 
levels. A subset of TET2 and OGT could only be eluted from the 
chromatin using 300mM NaCl or 0.2 M HCl, indicating that these 
TET2 and OGT species are tightly associated with the chromatin 
(Supplementary Fig. 6a). Interestingly, knockdown of TET2 by short 
hairpin RNA (shRNA) in ES cells abolished the chromatin-associated 
OGT, suggesting that TET2 may target OGT to chromatin (Sup- 
plementary Fig. 6a). To verify this phenomenon, we used 293T cells 
stably expressing TET2. Because the exogenous TET2 level was 
much higher than the endogenous level (Supplementary Fig. 7), the 
chromatin-bound OGT was markedly increased in 293T cells stably 
expressing TET2 (Supplementary Fig. 6b). Moreover, because the D2 
mutant of OGT abolished the interaction with TET2, only wild-type 
OGT but not the D2 mutant existed in the chromatin fraction, sug- 
gesting that the interaction with TET2 is important for the chromatin 
localization of OGT (Supplementary Fig. 6c). In addition, knockdown 
of OGT by shRNA did not significantly affect the chromatin retention 
of TET2 (Supplementary Fig. 6d). Collectively, these results suggest 
that TET2 recruits OGT to chromatin. 

Recently, it has been shown that histones can be modified by OGT at 
different sites'*'”. In particular, OGT regulates O-GlcNAcylation of 
histone 2B (H2B) at Ser 112 in vivo’. Because TET2 targets OGT to 
chromatin, we wondered whether TET2 regulates OGT-dependent 
H2B O-GlcNAcylation. With a specific antibody for H2B Ser 112 
GlcNAc, we found that H2B Ser 112 GlcNAc was significantly reduced 
in TET2-depleted ES cells (Fig. 2a). Moreover, TET2 also regulates 
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Figure 1 | TET2 forms a complex with OGT. a, Purification of TET2- and 
TET3-associated proteins. TET2- and TET3-associated proteins were analysed 
by mass spectrometry and are shown in the Supplementary Fig. 1a. b, OGT 
interacts with TET2 and TET3 but not with TET1. SBP-tagged TET1-3 
proteins were expressed and examined with indicated antibodies. Whole cell 
lysate (WCL) was used as the input. c, OGT interacts with TET2 endogenously 
in ES cells. Irrelevant IgG was used as the immunoprecipitation control (ctrl); 
PALB2 was used as the negative control. d, The DSBH domain of TET2 


O-GlcNAcylation of other histones (Supplementary Fig. 8). Con- 
current with TET2 overexpression inducing OGT association with 
chromatin, H2B Ser 112 O-GlcNAcylation was increased (Fig. 2b). 
Interestingly, the TET2 enzymatic-dead mutant (His1382Tyr/ 
Asp1384Ala) that still interacted with OGT also increased H2B 
Ser 112 O-GlcNAcylation, suggesting that the enzymatic activity of 
TET2 is not required for the regulation of H2B O-GlcNAcylation 
(Fig. 2b and Supplementary Fig. 9). Moreover, only wild-type OGT 
but not the enzymatic-dead mutant of OGT (Gly482Ser) nor the D2 
mutant lacking the interaction with TET2 could induce H2B $112 
O-GlcNAcylation in vivo (Fig. 2c). These data therefore suggest that 
the interaction between TET2 and OGT is important for OGT- 
dependent histone glycosylation in vivo. To examine the mechanism 
by which TET2 boosts the enzymatic activity of OGT, we performed 
an in vitro glycosylation assay using wild-type OGT and its D2 mutant. 
Using histone octamers as the substrate, both wild-type OGT and the 
D2 mutant only weakly glycosylated histones. The enzymatic activity 
of wild-type OGT was indistinguishable from that of the D2 mutant 
as both proteins can auto-glycosylate themselves (Supplementary 
Fig. 10). Moreover, supplementation of TET2 did not affect the enzy- 
matic activity of either wild-type OGT or the D2 mutant (Fig. 2d). 
However, when mono-nucleosomes were used as the substrates, sup- 
plementation of recombinant TET2 markedly increased the enzymatic 
activity of wild-type OGT but not the D2 mutant (Fig. 2d and 
Supplementary Fig. 11), suggesting that the interaction with TET2 
facilitates OGT recognition of the substrate. Although histones H3 
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interacts with OGT. Deletion mutants of TET2 mutants were expressed. The F3 
mutant containing the core-DSBH domain (catalytic domain) interacts with 
OGT. e, TPR5 and TPR6 of OGT interact with TET2. The D2 mutant of OGT, 
which lacks TPR5 and TPR6, abolished the interaction with TET2. f, OGT 
directly binds TET2. Sf9 cells were infected with baculoviruses encoding SBP- 
OGT and/or GST-TET2CD. The protein complex was purified by streptavidin 
beads or GST beads and examined by Coomassie blue staining. 


and H4 were glycosylated in vivo, only GlcNAcylated H2A and H2B 
could be clearly detected in this in vitro glycosylation assay (Sup- 
plementary Fig. 11). One possibility is that glycosylated H2A and 
H2B suppresses H3 and Hé4 glycosylation by OGT. Alternatively, the 
glycosylation sites on H3 and H4 either in the histone octamer or as 
mono-nucleosomes are not well exposed to the enzyme. In contrast to 
many other enzymes, OGT only efficiently glycosylates the substrates 
that it associates with’**’. Thus, it is likely that TET2 recognizes the 
chromatin and recruits OGT to the chromatin, and the chromatin- 
associated OGT glycosylates nucleosomal histones at its vicinity. 
Consistently, only TET2 but not OGT recognizes double-stranded 
DNA or mono-nucleosome with 5mC (Supplementary Fig. 12). 

To examine the distribution of OGT and TET2 on the chromatin of 
ES cells, we performed genome-wide chromatin immunoprecipitation 
(ChIP) sequencing analysis (ChIP-seq) using anti-OGT, anti-H2B 
Ser 112 GlcNAc and anti-mouse TET2 antibodies. We validated our 
ChIP-seq results using ChIP-quantitative PCR (qPCR) to examine 45 
different loci that represent a broad range of ChIP-seq fragment counts 
(Supplementary Fig. 13). Next, we compared the TET2 target genes 
with a published hmeDIP database’, and found that 47% of 5hmC- 
positive genes are bound by TET2 (Supplementary Fig. 14a). Most of 
the TET2 target genes are associated with high- and intermediate- 
density CpG promoters (Supplementary Fig. 14b, c), which are also 
positive for the trimethylated form of histone H3 at lysine 4 
(H3K4me3)” (Supplementary Fig. 14d). Gene Ontology analysis 
showed that OGT, H2B Ser 112 GlcNAc and TET2 are involved ina 
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Figure 2 | TET2 enhances histone glycosylation. a, Downregulation of TET2 
impairs H2B O-GlcNAcylation in ES cells. H2B O-GlcNAcylation was 
examined by immunoprecipitation with an anti-H2B antibody and western 
blot with an anti-GlcNAc antibody (RL2) or anti-H2B Ser 112 GlcNAc 
antibody. Histogram shows the relative level of H2B Ser 112 GlcNAc in TET2- 
downregulated cells (sh TET2) compared to that in control-shRNA-treated cells 
(shCtrl). b, Upregulation of wild-type TET2 or TET2 enzymatic-dead mutant 
(His1382Tyr/Asp1384Ala) induced H2B Ser 112 O-GlcNAcylation in 293 cells. 


variety of basic cellular processes (Supplementary Fig. 15). The ana- 
lysis showed that OGT and H2B Ser 112 GlcNAc display a consider- 
able overlap of target genes with TET2 (Fig. 3a), and similar binding 
profiles to TET2 at transcriptional start sites (TSS) (Fig. 3b, c). More- 
over, the binding sites of OGT, H2B Ser112 GlcNAc and TET2 
have the highest density around TSS (Fig. 3c). Knockdown of TET2 
markedly suppressed the recruitment of OGT to tested target genes 
and reduced the level of H2B Ser 112 GlcNAc at those loci (Fig. 3d). By 
contrast, knockdown of OGT had little effect on TET2 at these loci, 
suggesting that OGT does not contribute to the recruitment of TET2 to 
these loci (Fig. 3d). Next, we asked whether there was a correlation 
between TET2-mediated OGT targeting and the targeted gene tran- 
scription levels. In wild-type ES cells, we found that genes that were 
occupied by OGT and TET2 and enriched with H2B Ser 112 GlcNAc 
were associated with high levels of transcription in ES cells (Fig. 3e and 
Supplementary Fig. 16). Moreover, we examined alterations to the 
gene transcription profile in TET2 knockdown ES cells. The percen- 
tage of downregulated genes that were occupied by OGT, H2B Ser 112 
GlcNAc and TET2 was significantly higher than that of genes not 
occupied by OGT, H2B Ser 112 GlcNAc and TET2 (Supplementary 
Fig. 17). To validate these results, we performed qPCR to confirm 
the downregulation of a number of OGT, H2B Ser 112 GlcNAc and 
TET2 target genes (Fig. 3f and Supplementary Fig. 18). In addition, 
we reconstituted TET2-depleted ES cells with the His1382Tyr/ 
Asp1384Ala mutant that abolishes the enzymatic activity of TET2. 
The His1382Tyr/Asp1384Ala mutant could not restore 5hmC at 


c, The interaction between OGT and TET2 is important for H2B 
O-GlcNAcylation and H2B Ser 112G O-GlcNAcylation. Wild-type OGT, the 
enzymatic-dead mutant of OGT (Gly482Ser) and the D2 mutant were 
expressed in 293T cells. H2B O-GlcNAcylation and H2B Ser 112 
O-GlcNAcylation were examined. d, TET2 facilitated OGT-dependent histone 
glycosylation in mono-nucleosomes but not in recombinant core histones. 
Tritium-labelled GlcNAc was incorporated into the histones in the in vitro 
O-GlcNAcylation assay. All error bars denote s.d., n = 3. 


the TSS of a set of the TET2 and OGT common target genes. How- 
ever, the His1382Tyr/Asp1384Ala mutant restored H2B Ser 112 
GlcNAc and partially rescued the TET2 and OGT target gene ex- 
pression (Supplementary Fig. 19). These results indicate that OGT- 
dependent histone O-GlcNAcylation contributes to TET2-dependent 
gene transcription. 

Taken together, here we have shown that TET2 and OGT form a 
complex, which might regulate gene transcription (Supplementary 
Fig. 20). Besides TET2, TET3 also interacts with OGT. Like TET2, 
TET3 might also target OGT to chromatin for gene transcription regu- 
lation. Owing to the different tissue distribution®”**, it is TET2 that 
targets OGT to the chromatin and regulates H2B O-GlcNAcylation in 
ES cells. Recently, it has been reported that H2B O-GlcNAcylation is 
associated with active TSS and positively regulates transcription’. 
However, 5hmC-enriched regions are associated with both transcrip- 
tional activation and repression” *”®. Because TET is associated with 
the transcriptional repression complex Sin3A, it is likely that TET 1- 
dependent 5hmC is associated with transcriptional repression’’. By 
contrast, TET2 exists in a totally different complex with OGT but 
not Sin3A. Thus, the TET2-OGT complex might be involved in tran- 
scriptional activation in ES cells. Further analysis on each individual 
5hmC-enriched locus could distinguish the identity of 5hmC in tran- 
scription regulation. Moreover, OGT also affects other histone modi- 
fications during mitosis”*. It is possible that chromatin remodelling 
regulated by histone GlycNAcylation could be a general phenomenon 
in other biological processes. In addition to the transcription activation, 
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Figure 3 | TET2 regulates H2B Ser 112 GlcNAc and gene transcription. 

a, Venn diagram shows a considerable overlap between OGT, H2B Ser 112 
GlcNAc and TET2 target genes. b, Mean distribution of tags at gene TSS 

(+4 kilobases (kb)). bp, base pairs. c, Examples of OGT, H2B Ser 112 GlcNAc 
and TET2 ChIP-seq results in mouse ES cells. d, ChIP-qPCR was performed to 
examine TET2, OGT and H2B Ser 112 GlcNAc in control, TET2-knockdown or 
OGT-knockdown cells (using shCtrl, shTET2 or shOGT, respectively). e, The 
expression of TET2 and OGT common target genes is higher than average gene 
expression in ES cells. Data analysis is explained in the Methods. Box plots show 
median, twenty-fifth and seventy-fifth percentile expression levels in ES cells. 
P<0.00001. f, Loss of TET2 or OGT is associated with the reduced expression 
of their common target genes. All error bars denote s.d., n = 3. 


OGT is also involved in transcription repression’””?*°. Thus, TET2 is 
only one of the functional partners of OGT. OGT in other complexes 
may have an important role in gene silencing’”’””’. 


METHODS SUMMARY 


Details of cell cultures, plasmids, RNA interference and antibodies used, as well as 
descriptions of methods for purification of TET enzymes associated proteins, 
western blotting, immunoprecipitation, mass spectrometry analysis, in vitro 
5hmC assay, dot blotting assay, silver staining, in vitro O-GlcNAcylation assay, 
histone fraction, electrophoretic mobility shift assay, chromatin immunoprecipi- 
tation assay, DNA immunoprecipitation assay, ChIP sequencing, messenger RNA 
analysis and statistical analyses are provided in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Plasmids and antibodies. For protein affinity purification and other analyses, 
full-length complementary DNAs of human TET1, TET2 and TET3 were cloned 
into the pS-Flag-SBP (SBP) vector. For generating internal-deletion mutants, OGT 
cDNA fragments were PCR amplified and cloned into pCMV-Myc. For generat- 
ing truncation mutants, TET2 cDNA was PCR amplified and cloned into the SBP 
vector. TET2CD (amino acids 916-1921) was cloned into the SBP vector and 
pFast-Bac vector. Internal-deletion mutants of OGT, enzymatic-dead mutant 
(Gly482Ser) of OGT and enzymatic-dead mutant (His1382Tyr/Asp1384Ala) 
of TET2 were generated using the QuikChange site-directed mutagenesis kit 
(Stratagene). 

Rabbit anti-mouse TET2 antibody was raised against amino acids 1-200 of 
mouse TET2. Rabbit anti-mouse TET 1 antibody was raised against the C terminus 
(amino acids 1500-2008) of mouse TET1. Anti-OGT antibody was purchased 
from Novaus. Anti-Flag, anti-Myc and anti-f-actin antibodies were purchased 
from Sigma. Anti-5hmC polyclonal antibody was purchased from Zymo Research. 
Antibodies against histone H2A, H2B, H3, H4, H3K4me3, H3K27me3 and H3 
acetylated at Lys 14 (H3K14Ac) were purchased from Millipore. RL2 monoclonal 
antibody against O-GlcNAc was purchased from Abcam. CTD110.6 monoclonal 
antibody against O-GlcNAc was purchased from Covance. Monoclonal antibody 
against H2B Ser 112 GlcNAc was generated as described previously’’. 
Purification of TET-associated proteins. To search for binding partners of TET2 
and TET3, we collected 11293T cells stably expressing SBP-TET2 and SBP-TET3, 
respectively, and washed the cell pellets with PBS. Cells were lysed with 30 ml ice- 
cold NETN300 buffer (50 mM Tris-HCl, pH 7.4, 2 mM EDTA and 300 mM NaC)). 
The soluble fraction was incubated with 0.5 ml streptavidin-conjugated agarose 
beads. The beads were washed with NETN100 buffer (50 mM Tris-HCl, pH 7.4, 
2mM EDTA and 100 mM NaCl) three times. Associated proteins were eluted with 
2mM biotin (Sigma) in PBS, and further incubated with 50 il S-protein-agarose 
beads (Novagen). The bound proteins were eluted with SDS sample buffer (20% 
glycerol, 120 mM Tris-HCl, pH 6.8, 4% SDS, 0.02% bromophenol blue and 2.5% 
B-mercaptoethanol), and analysed by 10% SDS-PAGE and mass spectrometry. 
Cells expressing empty vector were used as purification controls. 

Silver staining. Purified materials were load onto 8% SDS-PAGE for electrophor- 
esis. The gel was fixed with 50% methanol, 10% acetic acid for 15min, and 
followed by further fixation with 10% ethanol, 5% acetic acid for 6 min. For 
staining, the gel was agitated in 50 ml 20 mg] * Na,S,O, for 30 min, then stained 
in 50 ml AgNO; solution (100 mg AgNO; in 0.1% formaldehyde) for 30 min. After 
washed with ddH20, the gel was treated with 50 ml image developer (3% NazCO3, 
50 pl 37% formaldehyde and 250 il 0.2% sodium thiosulphate). The reaction was 
stopped using stop solution (12.5 ml acetic acid and 25 g Tris in 500 ml ddH20) 
when clear bands were observed. 

Recombinant proteins. Recombinant proteins were purified from Sf9 insect cells. 
For generating baculovirus, DNA fragments containing full-length OGT, and 
deletion mutants of OGT, TET2CD and TET3CD were subcloned into the 
pFastBac vector with a GST or SBP tag. Baculoviruses were generated according 
to manufacturer’s instructions (Invitrogen). After Sf9 cells were infected with 
baculoviruses for 48 h, the cells were collected, washed with PBS and lysed with 
ice-cold NETN100 buffer. The soluble fraction was incubated with glutathione- 
sepharose beads (for GST-tag proteins) or streptavidin-conjugated beads (for 
SBP-tag proteins) and eluted with glutathione or biotin, respecitvely. To purify 
protein complex, Sf9 cells were co-infected with baculoviruses encoding GST- 
TET2CD and SBP-OGT for 48 h. The cells were then lysed and the soluble fraction 
was incubated with glutathione-sepharose beads or streptavidin-conjugated beads 
as indicated. 

Cell lysis, immunoprecipitation and western blotting. For immunoprecipi- 
tation, cells were lysed with ice-cold NETN300 buffer containing 10 mM NaF 
and 50mM f-glycerophosphate. Supernatants were incubated with indicated 
antibodies and protein-G-conjugated sepharose beads (Amersham Pharmacia). 
Precipitates were washed five times with NETN100, subjected to SDS-PAGE and 
western blotting with indicated antibodies. 

To examine the localization of OGT (Supplementary Fig. 6), cell pellets were 
lysed with 400 1p] NETN100 buffer. After centrifugation, the supernatants were 
named as 100 mM NaCl samples. The insoluble pellets were collected, washed with 
ice-cold PBS, and incubated with 400 unl NETN300 buffer on ice. After centrifu- 
gation, the supernatants were named as 300 mM NaCl samples. The remaining 
pellets were washed twice with ice-cold PBS and then treated with 200 ll 0.2N 
HCI. The supernatants were neutralized with 200 il Tris-HCl, pH 8.8, and named 
as 0.2 N HCl fractions. Twenty microlitres of each sample was loaded onto 7.5% 
SDS-PAGE gels and immunoblotted by indicated antibodies. The total amount of 
OGT was calculated by adding OGT from these three different fractions in each 
sample. The percentage of OGT in each fraction was presented. 
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Histone fraction. To examine histone glycosylation by immunoprecipitation and 
western blot, we followed a protocol from ref. 16. In brief, cells were lysed in 
20 mM Tris-HCl, pH 8.0, 600 mM NaCl, 0.5% NP-40, 0.5% SDS, 0.5% deoxycho- 
late, 1mM EDTA and 2 uM of the OGA inhibitor PUGNAc (Sigma). The cells 
were sonicated and clarified by centrifugation. The lysate was then diluted by 
NETN100 and histones were immunoprecipitated with the indicated antibodies. 

To examine histone glycosylation by western blot, cells were lysed by NETN300 
buffer with 2 uM PUGNAc. The pellets were washed twice with NETN300 and 
treated with 0.2 N HCI. After centrifugation, the supernatants were neutralized by 
the same volume of Tris-HCl, pH 8.8. Samples were examined by 10-20% gradient 
SDS-PAGE or 15% triton-acidic acid-urea (TAU) gel electrophoresis. 

TAU gel electrophoresis. Fifteen per cent TAU gel was prepared as reported*’. In 
brief, the gel contained 15% acryl:bisacrylamide 60:0.4, 6 M urea, 5% acetic acid 
and 0.37% Triton X-100. Acid-extracted histones, histone octamers or mono- 
nucleosomes were dissolved in 2* loading buffer (8 M urea, 10% glycerol and 
2.5% acetic acid) and loaded onto the gel. The gel was run in 5% acetic acid at 200 V 
for 140 min. For Coomassie blue staining, the gel was stained with Coomassie blue 
250. For autoradiography, the gel was treated with EN3HANCE (PerkinElmer) 
before exposure to X-ray film. For western blot, the gel was washed twice in 300 ml 
of 50 mM acetic acid and 0.5% SDS for 30 min each time, and twice in 300 ml of 
transfer buffer (25 mM Tris, 192 mM glycine, 20% (v/v) methanol and 0.1% SDS) 
for 30 min each time. The electrotransfer reaction was carried out at a constant 
current of 400 mA for 2h at room temperature. 

RNA interference. shRNA sequences targeting human OGT (5’-GCACATAG 
CAATCTGGCTTCC-3’ (ref. 32) or 5'-CCAAACTTTCTGGATGCTTAT-3’), 
mouse Ogt (5'-GCACACAGCAATCTGGCCTCC-3’ or 5'-AGGGAACTAGAT 
AACATGCTT-3" (ref. 33)) and mouse Tet2 (5'-CACTACTAACTCCACCC 
TAAA-3’ (ref. 34) or 5'‘-GCTCTGAACAGTATTCAAAGC-3" (ref. 6)) were 
inserted into the pMSCV-puro-pSilencer vector. Generating retrovirus was car- 
ried out as previously described**. pMSCV-puro-shmock, pMSCV-puro-shOGT 
or pMSCV-puro-shTET2 vectors were used to transfect packaging cell line 293T 
with two other helper packaging plasmids pMD-MLV-OGP (gag-pol) and pVSV- 
G (env). Tissue culture supernatants from 293T packaging cells transfected with 
the above plasmids were collected. Viral particles were concentrated by centrifu- 
gation at 50,000g for 3 h and then added to the culture media of ES cells. The viral- 
infected cells were selected by puromycin for 72 h and then used in the following 
experiments. Similar results were obtained from shRNA treatment with different 
sequences. 

In vitro O-GlcNAcylation assay. Recombinant SBP-OGT protein (0.5 jig) was 
incubated with 21g substrates (histone octamers, mono-nucleosomes or 210- 
bp 5hmC-containing oligonucleotides with sequence listed in Supplementary 
Table 7) and 0.5mM (0.5 p1Ci) UDP-[?H]GIcNAc (from ARC) in 25 pl reac- 
tion buffer (50 uM Tris-HCl, pH 7.5, 12.5mM MgCl, and 1 mM dithiothreitol 
(DTT)) for 1h at 37 °C. The reaction was stopped by 10 mM EDTA. To detect 
potential DNA glycosylation, DNA was pulled down by streptavidin beads. The 
beads were subjected to scintillation counter measurement. [-glucosyltransferase 
(Zymo Research) was used as a positive control to produce GlcNAcylated-5hmC 
(5ghmC) following the protocol of Zymo Research, with UDP- [(H]GIcNAc as the 
donor. For histone glycosylation analyses, the reaction was resolved with SDS- 
PAGE, and then subjected to autoradiography after incubation with EN3HANCE 
(from PerkinElmer). Nucleosome packaging was performed as described below 
with recombinant core histones purified from bacteria. 

Electrophoretic mobility shift assay. The 157-bp DNA oligonucleotide (Sup- 
plementary Table 7) containing 5mC was produced by PCR using dATP, dGTP, 
dmCTP and dTTP. The oligonucleotide was [**P]-labelled at the 5’ end by T4 
DNA kinase and purified using a G50 spin column (Pharmacia Biotech). The 
labelled oligonucleotide (0.1 pmol) was incubated with GST (0.4 1g), GST-OGT 
(0.4 ug) and/or GST-TET2CD (0.1, 0.2 or 0.411g), in reaction buffer (50 mM 
HEPES, pH 7.9, 10mM MgCl, 50 mM NaCl, 75 uM Fe(NH,), and 0.1 mg ml? 
bovine serum albumin (BSA)) for 30min at room temperature. The samples 
were loaded onto 6% native polyacrylamide gel and run at 150 V in 0.5 Tris- 
borate-EDTA (TBE) buffer. The gels were then fixed with 30% methanol and 10% 
acetic acid and exposed to X-ray film. 

For electrophoretic mobility shift assays using mono-nucleosomes, we gene- 
rated oligonucleotides containing cytosine or 5mC by PCR. PCR products were 
end-labelled with *’P and purified by gel electrophoresis. To generate mono- 
nucleosomes, labelled DNA oligonucleotides were incubated with core histones 
at a 1:1 molar ratio in a buffer containing 2M NaCl and 1 mM DTT to a total 
volume of 30 ul. Mixtures were then serially dialysed against 2 M, 0.85 M, 0.65 M 
and 0.2 M NaCl in 10mM Tris-HCl, pH 7.5, 1mM EDTA and 1mM DTT for 
4-10h at each step. Mono-nucleosomes were separated from free DNA by ultra- 
centrifugation in 5-30% linear glycerol gradients. After centrifugation at 150,000g 
in a SW50.1 rotor for 18h at 4°C, gradient fractions were collected. Fractions 
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containing mono-nucleosomes were pooled, and dialysed against 10 mM Tris- 
HCl, pH8.0, and 1mM DTT. For nucleosome-binding reactions, GST-OGT 
(0.2 4g) and GST-TET2CD (0.05, 0.1 or 0.2 1g) were incubated with 20nM 
mono-nucleosomes in 20, with final buffer conditions of 50mM HEPES, 
pH7.9, 10mM MgCh, 75 uM Fe(NH4)2, 50mM NaCl, 1 mM DTT, 1% glycerol 
and Imgml ’ BSA. The reactions were incubated at room temperature for 
1h. Binding reactions were loaded onto 4% native polyacrylamide gels at 150 V 
in 0.5X TBE buffer. After electrophoresis, gels were dried and examined by 
autoradiography. 

ChIP assay and DNA immunoprecipitation assay. ChIP assays were preformed 
according to the protocol described by Upstate. ES cell DNA was sonicated to an 
average size between 300 and 600 bp. Solubilized chromatin was immunoprecipi- 
tated with antibodies against TET2, OGT, H2B Ser 112 GlcNAc, H3K4me3 and 
H3K27me3. Antibody-chromatin complexes were pulled-down using protein 
A-sepharose, washed and then eluted. After cross-link reversal and proteinase 
K treatment, immunoprecipitated DNA was extracted with phenol-chloroform, 
ethanol precipitated and treated with RNase. ChIP DNA was quantified using 
PicoGreen. 

For hydroxymethylated DNA immunoprecipitaiton assay, genomic DNA was 
extracted using the Qiagen QIAamp DNa mini kit. The purified genomic DNA 
was sonicated using Diagenode Bioruptor 200. The DNA fragment size was deter- 
mined by 2% agarose gel, and should be 200-800 bp. The samples were heat- 
denatured and mixed with 1 ug 5hmC antibody. The mixtures were incubated 
overnight at 4 °C and then pulled down by protein A-sepharose beads. The beads 
were extensively washed and digested by proteinase K. DNA was further purified 
by phenol-chloroform extraction and ethanol precipitation. 5amC immunopre- 
cipitated DNA was quantified using PicoGreen. 

ChIP sequencing. DNA fragments isolated from ChIP were repaired to blunt 
ends by T4 DNA polymerase and phosphorylated with T4 polynucleotide kinase 
using the END-IT kit (Epicentre). A single ‘A’ base was added to the 3’ end with 
Klenow. Double-stranded adaptors (75 bp with a “T’ overhang) were ligated to the 
fragments with DNA ligase. Ligation products between 200 and 600 bp were gel- 
purified to remove unligated adaptors and subjected to 20 PCR cycles. Completed 
libraries were quantified with PicoGreen. For ChIP-seq analyses, the DNA libra- 
ries were analysed by Solexa/Illumina high-throughput sequencing. The read 
quality of each sample was determined by FastQC software. After prefiltering 
the raw data by removing sequence adapters and low-quality reads, the tags were 
mapped to the mouse genome (assembly mm9) by Bowtie software. Parameter 
settings were listed as follows: —v, 3 (reported alignments with at most three mis- 
matches), —5, 3 and —3, 3 (trim3 bases from 5’ and 3’ end to remove low-quality 
bases). Peak detection was performed using MACS software from Galaxy browser 
(http://www.galaxy.psu.edu). Parameters settings were as follows: IgG ChIP-seq 
aligned reads were used as control file, tag size with 25 bp, band width with 300 bp. 
Venn diagram analysis was performed with Galaxy browser. The peaks obtained 
from ChIP sequencing were matched to the annotated reference genome (mouse 
mm49) using Cisgenome 2.0. To view the peak density and position, Cisgenome 2.0 
was used. The relative accumulations of tags around the TSS were performed using 
SEQMINER software. The TSS position of each gene was downloaded from the 
UCSC genome browser. Gene Ontology analysis was performed using Panther. All 
the statistics and plottings were obtained using the statistical program R. To 
compare TET2 and 5hmC, hmeDIP database was downloaded from NCBI’. A 
set of 30 PCR primer pairs (Supplementary Table 5) was designed to amplify 100- 
140-bp fragments from genomic regions showing a wide range of signals for TET2, 
OGT and H2B Ser 112 GlcNAc by ChIP-seq. ChIP-qPCR values reflect two inde- 
pendent ChIP assays; each was evaluated in duplicate by qPCR. 

Messenger RNA analysis. To examine gene expression, total RNA was purified 
from mouse ES cells using RNeasy (Qiagen). The RNA was reverse transcribed 
using TaqMan reverse transcription reagents from ABI. qPCR was carried out 
in 20ul aliquots using SYBR green master mix (NEB) containing 10nM 6- 
carboxyfluorescein (Sigma) as a reference dye, 50 to 100ng of cDNA and 2 uM 
primers. The reactions were performed on a Bio-Rad iCycler and quantified using 


the iCycler iQ software. The relative quantities of each mRNA were determined for 
each sample based on the threshold cycle (Cr) value normalized to the corre- 
sponding values for mouse Gapdh. Oligonucleotide sequences of the primers for 
qPCR assays are listed in the Supplementary Table 6. All reactions were performed 
in triplicates. 

RNA expression data for shCtrl and shTET2 ES cells were generated from polyA 
RNA using GeneChip Mouse Genome 430 arrays (Affymetrix) following 
Affymetrix procedures and analysis. The results were shown in Supplementary 
Table 3. The expression value of each transcript was showed in log; scale. The data 
analysis was performed using Cluster 3.0 and Java Treeview. 

In vitro 5hmC assay. To examine enzymatic activity of TET2 in vitro, 0.4 1g 
double-stranded DNA oligonucleotides (listed in Supplementary Table 7) that 
contain C, 5mC or 5hmC were incubated with 1 ug TET2CD or 1 ppg TET2CD, 
together with 1 j1g OGT proteins in the presence of 50 mM HEPES, pH 7.9, 100 mM 
NaCl, 75 uM Fe(NH,)2, 2mM ascorbate and 1mM «-ketoglutate at 37°C for 
40min. The DNA substrates were then purified by ethanol precipitation and 
digested with TapI. After the treatment with calf intestinal alkaline phosphatase, 
the DNA substrates were labelled with [y-**P]ATP by T4 polynucleotide kinase, 
purified by ethanol precipitation and digested with 10 tug DNase I and 10 pg phos- 
phodiesterase I. The samples were separated by thin layer chromatography (TLC) 
on PERcellulose TLC plates (Merck) using isobutyric acid: NH,OH:H,O at 66:2:20. 
After air drying, the TLC plates were exposed to X-ray film. 

Dot blotting assay. Genomic DNA isolated from 293T cells expressing TET2, 
OGT or both was denatured by 0.2N NaOH and dotted on Hybond-N™ nitro- 
cellulose membrane (Amersham Pharmacia Biotech). After ultraviolet cross-link- 
ing, membranes were blocked overnight with 10% non-fat milk and 1% BSA in 
TBST (150 mM NaCl, 10 mM Tris, pH 8.0, and 0.1% Tween20) at 4 °C followed by 
1h incubation with either anti-SmC or anti-5hmC antibodies at room tempera- 
ture. Membranes were washed four times with TBST, incubated with horseradish 
peroxidase (HRP)-conjugated goat anti-mouse or anti-rabbit antibodies (GE 
Healthcare), washed with TBST and developed using the ECL+ detection system 
(GE Healthcare). 

Mass spectrometry analysis. For 5ghmC standard, B-glucosyltransferase was 
used to produce 5ghmC following the protocol of Zymo Research using UDP- 
GlcNAc as the donor. The DNA was digested with DNA degradase plus (Zymo 
Research). The samples were subjected to liquid chromatography tandem mass 
spectrometry (LC-MS/MS). Liquid chromatography was performed using a 
2.1 50mm HSS T3 1.8m column (Waters), with gradient elution at a flow 
rate of 500 pl min” ' using 0.02% acetic acid in water as mobile phase A and metha- 
nol as mobile phase B. The gradient was (96% A + 4% B) to (70% A + 30 B) in 
4.7 min. The elutes were directed to the mass spectrometer that was running in the 
product scan mode (60-500 m/z) selecting 461 as precursor ion (5ghmC). The 
collision energy used was 30 V. 

For detecting 5ghmC or glucosylated-5hmC, genomic DNA from 293T cells 
expressing TET2 and OGT was digested as above and subjected to LC-MS/MS 
analysis using the same column and conditions as above. The mass spectrometer 
was running in multiple reaction monitoring mode, monitoring the transition of 
m/z 242.0 to 126.0 (SmC), m/z 258.0 to 142.0 (ShmC), 461.0 to 345.0 (SghmC) and 
m/z 420.0 to 304.0 (glucosylated-S5hmC), with the collision energy as 15 V. 
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CLINICAL RESEARCH 


Conducting a trial 


Beginners hoping to initiate and complete a clinical trial 
must understand the complexities of the process. 


BY KELLY RAE CHI 


year at Icahn School of Medicine at 
Mount Sinai in New York, he took a 
12-month sabbatical to launch his first clinical 
trial. He wanted to evaluate whether patients 
using an anti-blood-clotting drug, usually to 
prevent heart attacks, should stop before gen- 
eral surgery. He thought that if he dedicated 
himself to the project for 12 months, he could 
wrap up any loose ends before he graduated 
in 2013. That was optimistic, to say the least. 
Chu admits that his goal — to recruit 
200 patients, half of whom would continue tak- 
ing the drug before surgery while the other half 
discontinued it — was ambitious for a year-long 
project. His adviser offered guidance through- 
out, but the day-to-day logistics were up to Chu. 
Even with an encouraging adviser, financial 
support from a Doris Duke Clinical Research 
Fellowship and access to hundreds of potential 
participants, planning the trial was far more 
time-consuming than Chu had expected. All 
investigators need help from specialists and 
other partners to embark on a clinical trial, 
but especially first-timers. Although leading 
a trial often falls to senior researchers, junior 
investigators training as physicians or under- 
taking PhDs can have a major role — especially 
if they are hoping to make clinical studies a big 
part of their career. Chu needed assistance at 
almost every step — from his adviser, support 
staff such as nurses and research coordinators, 
biostatisticians, members of regulatory review 
boards, surgeons and cardiologists. In the end, 
he had to scale the work back into a pilot study, 
but the experience taught him a great deal 
about what it takes to make a trial happen. 


B= Edward Chu entered his fourth 


FIRST STEPS 

Initiating and sponsoring a clinical trial to 
generate safety and efficacy data for a drug, 
diagnostic test, device or therapy protocol is 
an all-consuming team project. The planning 
stages, which involve transforming an idea into 
a detailed protocol, assessing and procuring 
resources and getting approval to do the study, 
are almost always more complicated and time 
consuming than a junior investigator expects. 
They involve multiple reviews from regulatory 
agencies, and a sometimes gruelling effort to 
motivate busy clinicians — who often have 
little stake in the outcome — and their staff, 
key collaborators who help to provide access 
to a pool of patients. All these moving parts 
require careful orchestration. For a new | 
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> investigator, each step of a clinical trial rep- 
resents an important learning experience, says 
Karen Zier, associate dean for medical-student 
research at Mount Sinai, who helped to pair 
Chu with his adviser. These are not lessons that 
trainees can learn solely from a textbook. 

A clinical trial starts with a scientific ques- 
tion. But before designing the trial, researchers 
must have an idea of whether it is feasible: they 
must transform the idea into a testable hypoth- 
esis; consider the target patient population; 
and determine the appropriate time frame in 
which to get results, says Edward Kim, chair 
of solid-tumour oncology and investigational 
therapeutics at Levine Cancer Institute in 
Charlotte, North Carolina. Feasibility analysis 
is often handled by clinical-research coordi- 
nators or other experts at the investigator's 
home institution, who can help the investiga- 
tor to decide, from a financial and operational 
standpoint, whether the study goal is realistic. 
“There are many cases when you can have a 
great scientific idea, but it’s not practical [to do 
a clinical trial],” says Kim. Failure is likely, he 
says, ifa trial has too many rigid requirements 
for patient inclusion, or is too complex to 
attract or treat enough patients. Junior investi- 
gators should talk to mentors and experienced 
clinical researchers in their field to help them 
to decide whether an idea is realistic. 

Just calculating the appropriate number of 
participants can be a challenge, says Norma 
Terrin, director of the Research Design 
Center and the Biostatistics Research Center 
at Tufts Medical Center in Boston, Massa- 
chusetts. A biostatistician or epidemiologist 
can help. Other concerns include whether to 
do a blinded study (in which the participants, 
researchers or both are unaware of which treat- 
ment is being administered), a randomized one 
(in which participants are assigned to a treat- 
ment group by chance) or both; whether there 
are any potential sources of bias; how research- 
ers will analyse the data; why data might be 
missing (reasons include patients dropping out 
of the study or not complying with protocols); 
and how researchers will deal with missing data 
points. With due consideration, the study can 
be designed to minimize data loss, says Terrin. 


FINANCIAL STRUGGLE 

Trials can be expensive: they range from tens of 
thousands of dollars for small studies to several 
hundred million for large, multicentre trials, 
according to a 2010 report from the Institute of 
Medicine in Washington DC (Forum on Drug 
Discovery, Development, and Translation 
Transforming Clinical Research in the United 
States National Academies Press, 2010). Costs 
vary from nation to nation owing in part to 
differing regulatory standards and patient- 
recruitment practices. Even within a single 
country, institutions bill in different ways for 
insurance, lab tests and supplies, staffing and 
other study costs. This combination of factors 
makes financing a major hurdle. 


Early-career investigators might be able 
to get their own institutions or foundations 
to provide small start-up grants — typically 
ranging from US$20,000 to $50,000 — to 
fund collection of preliminary data. At the MD 
Anderson Cancer Center in Houston, Texas, 
for example, the Duncan Family Institute Seed 
Funding Research Program awards $50,000 
per year for two years to support preliminary 
research on cancer prevention and risk, with 
the goal of improving the success of proposals 
submitted for larger grants. And some of the 
60 institutions supported by US National Insti- 
tutes of Health Clinical and Translational Sci- 
ence Awards (CTSAs) provide money for pilot 
studies. For exam- 
ple, the Institute for 
Clinical and Transla- 
tional Science at the 
University of Iowa in 
Iowa City offers up to 
$50,000 for a prelimi- 
nary study lasting no 
longer than one year. 
There is a wide range 
of CTSA start-up 
grants, from a small 
amount to screen 
an assay to larger 


amounts to get pro- “A study can fall 
jects off the ground, by the wayside 
says Bobbi Gardner, Unless someone 
a spokeswoman for is there to push 
the National Center the protocol 

for Advancing Trans- andresearch 
lational Sciences in agenda.” 
Bethesda, Maryland, Edward Chu 


which administers 
the CTSAs. 

The figures are similar in other countries. 
Cancer Research UK, for example, offers 
feasibility-study grants of around £25,000 
(US$40,000) a year for two years. In rare cases, 
it might provide as much as £80,000. 

Many institutions, including those with 
CTSAs, fund shared clinical-trial resources 
in the form of, for example, a biostatistics 
department, a study facilitator or coordinator, 
or specific programmes. Foundations run by 
patient advocates — who are often impatient 
with the slow pace of clinical research — not 
only fund clinical trials but also support them 
by, for example, helping to recruit participants. 
Ifa trainee’s institution has no track record in 
clinical research, there is the possibility of col- 
laborating with a researcher at one that does. 

Then there are industry partnerships: com- 
panies will often fund clinical trials of their 
own drugs or devices. Junior investigators 
should keep in mind, however, that involving 
industry can mean that it takes longer to sort 
out reimbursement, data sharing, intellectual- 
property issues and any extra reviews required 
legally or by the company itself. 

Before putting together a team or enroll- 
ing patients, clinical-trial investigators must 
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secure regulatory approval for their study plan. 
The specifics of the approval process vary by 
country and even within countries, but the 
basic goal is the same: to protect the safety 
of research participants. Protocols should be 
written according to the local version of Good 
Clinical Practice, an international standard 
for designing and carrying out clinical tri- 
als, provided by International Conference on 
Harmonisation in Geneva, Switzerland. 

In some countries, including the United 
States, Australia, Japan, New Zealand and South 
Africa, the protocol is usually examined by an 
institutional review board (IRB) affiliated with 
the investigator's hospital or medical centre. 
IRBs uphold federal, state and local regulations 
and university policies. Each board interprets 
regulations differently, so multisite studies 
involving more than one board are often com- 
plicated. As a general rule, researchers should 
make their protocols extremely easy to under- 
stand — members of review-board panels will 
not necessarily be familiar with the subject area. 

Researchers must be prepared to justify every 
aspect ofa protocol; for example, if it calls for 
four biopsies, the investigator must be able to 
explain the reason for each. If the IRB thinks 
that the risks of any of the steps outweigh the 
potential benefits, it will ask for changes. This 
back-and-forth can take months, and it doesn't 
end there: IRBs often ask for update reports at 
regular intervals during the trial. 

Researchers in the United States may also 
need to allow an extra one or two months to 
file an application for an investigational new 
drug or investigational new device to the Food 
and Drug Administration (FDA), which will 
do a safety review. This step doesn’t just apply 
to new pharmaceutical compounds: it may also 
be necessary if, for example, an investigator is 
using a nutritional supplement to treat a dis- 
ease; in that case, the supplement is technically 
anew drug and requires an FDA review, notes 
Harvey Arbit, a regulatory-affairs consultant 
who lectures at the University of Minnesota 
in Minneapolis. Complicating matters, the 
wording of the protocol may influence whether 
filing is necessary: if the filer states that the 
supplement is “normalizing physiological 
functions” rather than “treating”, the decision 
becomes more difficult. There is no foolproof 
way to decide, says Arbit — when in doubt, it 
is safer to file. Even if the investigator does this 
but ends up receiving a letter from the FDA 
stating that filing was unnecessary, it is good 
to have the documentation. 


THE RIGHT PEOPLE 

Chu’s goal of enrolling at least 200 patients in 
his study meant that he had to work with mul- 
tiple clinical specialists. But he found it daunt- 
ing, especially as a newcomer, to approach 
cardiologists and surgeons individually to 
sell his study. Specialists tend to be sceptical 
of studies and protective of their patients — 
and Chu’s study had associated risks. But after 


approaching many physicians, he was eventu- 
ally able to get about a dozen on his team. 

Clinicians help to identify patients eligi- 
ble for the study. Crucially, they can boost 
recruitment by mentioning that they are 
excited about the project, says Jeff Burns, 
director of the Clinical and Translational Sci- 
ence Unit at the University of Kansas Medi- 
cal Center in Kansas City. But the next steps 
of enrolment — pre-screening the patients 
by reviewing medical histories, approach- 
ing them to gauge interest, discussing pro- 
tocols and informed consent, and setting 
up in-person screening visits — require a 
huge amount of work. A common mistake 
is for novice investigators to rely too much 
on the clinician's office to sign up patients. 
Clinicians and their staff focus on treating 
patients, not research; as a result, a study can 
fall by the wayside “unless someone is there 
to push the protocol and research agenda’, 
says Chu. Enrolment should be monitored 
at each step by a team member fully invested 
in the study — perhaps the principal inves- 
tigator, a trainee physician doing a medical 
residency or a medical student. Researchers 
with sufficient funds can hire a dedicated 
recruitment coordinator to screen prospec- 
tive participants for any reasons they should 
be disqualified from the study, as well as to 
explain the study and its risks and send out 
consent forms. 


RECRUITMENT DRIVE 

It took Chu six months to design his study, get 
IRB approval and assemble a research team. 
Only then could he begin recruiting patients 
and collecting data. A few months into data 
collection, he ran a preliminary analysis that 
revealed that he would in fact need several 
hundred participants in each study group to 
provide a significant result. By the end of his 
fellowship year, he had enrolled only 50. 

One problem was that a potential par- 
ticipant’s cardiologist and surgeon often 
disagreed about whether they should dis- 
continue or continue the anti-platelet medi- 
cation, so Chu could not invite that person 
to participate in the study. Furthermore, 
potential participants’ medical histories were 
often more complex than anticipated, mak- 
ing it difficult to determine who was suitable 
for the trial. 

Enrolment challenges are not unusual. A 
2011 study found that nearly one-third of 
clinical studies terminated at Oregon Health 
and Science University (OHSU) in Portland 
between 2006 and 2009 were under-enrolled 
for various reasons; such terminations cost 
OHSU at least $1 million in 2009 alone 
(D R. Kitterman et al. Acad. Med. 86, 1360- 
1366; 2011). Low recruitment is a big prob- 
lem in the United States and elsewhere, says 
William Balke, a programme director of clin- 
ical-research services at the University of Cali- 
fornia, San Francisco (UCSF). “If we don’t do 


a better job [at recruitment], we're wasting the 
public's money and we're not advancing sci- 
ence,’ he adds. One reason for the problem is 
alack of thorough feasibility analysis to deter- 
mine, for example, whether there are enough 
patients to do the desired study. 

Some institutions run formal recruit- 
ment programmes; for example, UCSF has 
a Recruitment and Implementation Core, 
which serves the university and affiliated 
investigators. Investigators can also turn to 
online participant registries and tools such 
as ResearchMatch.org — a free online service 
that connects volunteers with researchers — 
or even use less conventional communica- 
tion routes, such as social media. “We have 
been extraordinarily, surprisingly successful 
recruiting patients through Craigslist and 
keeping in touch with them through e-mail, 
text messages and Twitter,’ says Balke. 

But researchers need to exercise caution 
when reaching out to patients through social 
media. Rahlyn Gossen, founder of Rebar 
Interactive, a digital-marketing company 
based in New Orleans, Louisiana, which 
recruits and retains study subjects, says 
that researchers’ messages to potential par- 
ticipants need to be approved by the local 
IRB, regardless of whether they are online. 
Regulations on patient recruitment were not 
written with social 
media in mind, 
so Gossen recom- 
mends that inves- 
tigators check with 
the board before 
they submit their 
official protocol for 
approval. “Some 
boards might allow 
you some varia- 
tion, but I’ve also 
run into some that 
won't let you post 


“If we don’t do on social media at 
a better job at all,” she says. 

recruitment, At the end of 
we’re wasting Chu’s clinical-trial 
the public’s year, he and his 
money.” supervisor decided 
William Balke to turn the trial into 


a feasibility study, 
to be completed while Chu finishes medical 
school. Still, he feels that he accomplished a 
lot in one year — and he learned a lot about 
the complexities of the clinical-trial process. 
Now in his fourth year of medical school, 
Chu is still enrolling patients and collecting 
data, and is looking forward to launching a 
new clinical trial in the future. “It’s a learning 
experience, he says. With the understanding 
he has gained, Chu expects his next trial to be 
“easier to manage”. m 


Kelly Rae Chi is a freelance writer based in 
Cary, North Carolina. 
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FUNDING 
NSF grant changes 


The US National Science Foundation 
(NSE) has changed some of its grant- 
submission requirements, effective from 
14 January. The project-summary section 
of the submission now asks applicants to 
use separate text boxes for their proposal 
overview, their description of the project’s 
intellectual merit and their explanation of its 
broader impacts. Submitting these sections 
as one document will cause the application 
to be rejected. NSF spokeswoman Maria 
Zacharias says that reviewers were spending 
too long teasing out the merits and impacts 
of proposals. Applicants may also now list 
research products such as patents, data sets 
or software in addition to publications — 

a boon for junior investigators, says 
Zacharias. The changes stem from a review 
by the NSF’s oversight board, and a federal 
directive that the agency recognize the 
broader impact of research it supports. 


TENURE 
Respect for librarians 


Many more US university librarians 

should be tenured faculty members, argue 
groups representing universities and 
colleges, academic libraries and professors. 
A statement released on 10 January and 
spearheaded by the American Association 
of University Professors in Washington DC 
notes that librarians support research needs, 
contribute to intellectual and academic 
freedom, perform outreach and should 
have a right to contribute to university 
policy. “There's a lack of recognition 

as to what librarians actually do,’ says 
Deanna Wood, an associate professor and 
reference librarian at the University of New 
Hampshire in Durham and part of the Joint 
Committee on College Library Problems, 
which drafted the statement. 


SALARIES 

Academic pay lagging 
Early-career scientists with full-time 

jobs in US academia earn an average of 
US$58,000 annually, less than those in 
industry, non-profit or government, says 
a report from the US National Science 
Foundation (NSF). Industry pays the 
most: $100,000 per year in early career, 
and $130,000 for those 10 years past their 
PhDs. Academics who got a doctorate a 
decade ago or more earn $93,000 a year 
on average. Daniel Foley, a statistician 
for the NSF in Arlington, Virginia, says 
that the data underscore the need to train 
researchers for work outside academia. 
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THE SHORTLIST 


BY ADAM KUCHARSKI 


¢C Cc op work everyone. Seriously, this 
is outstanding” Hugo Vriers leafed 
through the stack of papers. “Two 

dogs and a monkey, extraordinary.’ 

“Chimp,” Professor Reed muttered. 

Vriers squinted at the page. “Quite. And 
how is he doing, Janet?” 

Dr Cranmer leant forward in her chair. 
“He's pretty sprightly, all things considered” 

Vriers turned to Professor Garcia, who 
was busy jabbing his phone with an elderly 
finger. “Pablo, did everything stay within the 
limits this time?” 

Garcia nodded. “All was smooth. No 
repeats of the second test.” 

“What happened in the second test?” Reed 
asked. 

“Knocked out the power in half of Lon- 
don, Vriers said. 

“Which half?” 

“South? 

Reed chuckled. “Ah well, these things hap- 
pen? 

“Tell that to the Mayor.” Vriers pushed the 
papers to one side. “Now, there's a few things 
we need to discuss. I’ve had a chat with the 
board, and we've decided to draw up a short- 
list of objectives” 

Garcia shook his phone as if it were a 
sandy shoe. “Objectives?” 

“Yes. We need a proper plan for this tech- 
nology. It’s all very well sending a beagle to 
the Renaissance or a chimp to the jazz age, 
but human time travel ... well, the possibili- 
ties are endless. Literally.” 

“What's on your shortlist?” Reed asked. 

Vriers flipped open a notebook. “Number 
one: ‘Get rid of post tyrants.” He paused. 
“Past. I think that should say ‘past.” 

“Which ones?” asked Cranmer. 

“We haven't decided yet. All of them per- 
haps? Top ten?” He tapped his pen against 
the pad. “Hitler seems like an obvious place 
to start” 

Garcia frowned. “Time is a delicate tapes- 
try. We should not naively pull at her threads.” 

Vriers looked at Cranmer, who shrugged, 
then turned back to Garcia. “I have a feeling 
youre not so keen, Pablo?” 

Garcia sighed, shaking his head. “You talk 
of changing history as if it were a grocery list. 


We cannot just decide 

to—” > NATURE.COM 
“Okay; Vriers said, — FollowFutures: 

“how about I just put © @NatureFutures 

Hitler for now? With & go.nature.com/mtoodm 


Time management. 


maybe a little question mark in brackets. 
We can always add more later. Right, num- 
ber two: mysteries. There's a few things we 
ought to get to the bottom of?” 

“Can we find out what happened to Kit 
Marlowe?” Cranmer asked. 

Vriers’ pen hovered over the page. “Shake- 
speare’s friend? See, I’m not sure the board 
will go for that. How about Roswell, or JFK?” 

“Or the Amber Room, Reed said. 

“What's that?” Cranmer asked. 

“Big room. Made 
of amber. Someone 
pinched it from the Rus- 
sians in the ’’40s.” 

Cranmer raised 
her eyebrows. “Worth 
much?” 

“Probably.” 

“Actually, is dosh on 
your list, Hugo?” she 
asked. 

“Dosh?” Vriers said. 

“With our machine, 
we could get the lottery 
numbers.” 

Vriers winced. “Isn't 
that a little...’ 

“A little?” 

“Unethical? I imagine the committee 
would get annoyed.” 

“What committee?” 

“T don't know. I just assumed you lot hada 
committee for that sort of thing” 

Cranmer removed her glasses. “Even if we 
do, there isn’t much in the kitty anymore.” 

Reed nodded. “And school fees don't pay 
themselves.” 

“I see,’ Vriers said, scribbling on his pad. 
“Right, how about this? ‘Win lottery... but 
only a bit.” 

“Need five year’s boarding at least,’ Reed 
said. 

“Very well” Vriers turned the page. “Next 
we have ‘Inspiration. The directors were 
hoping we could get some new ideas from 
somewhere. Or somewhen, rather. Any sug- 
gestions?” 

“Td love to meet Newton,’ Cranmer said. 
“That would certainly inspire me.” 

“And Monroe,’ Reed added. 

“Can we try and focus on —” 

“I think she was best in 1955,” Reed said, 
leaning back in his chair. “The Seven Year 
Itch, what a—” 

“The future!” Vriers groaned. “Please, 
concentrate on the future. What can we 
bring back? What technology do we need?” 
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“Do we not risk a paradox?” Garcia said. 
“Tam concerned that —” 

“It’s fine.” Vriers said. “Our logic team are 
on it” 

“Logic team?” Reed asked. 

“The laddies in the basement; Cranmer 
said, pointing downwards, “the ones with toy 
DeLoreans on their desks.” 

Reed laughed. “I thought they were some- 
one’s kids.” 

“Tt is not so simple,” Garcia said. “No one 
can truly see how —” 

“They know what 
they're doing,” Vriers 
said. “Trust me, were 
fine on the paradox 
front.” 

“As long as nobody 
gets someone pregnant,” 
Cranmer said. 

“What?” 

“You know, like in 
The Terminator. Time 
babies” 

“Look, nobody is 
having any, um, time...” 

“Babies.” 

“,. time babies. Now, 
let’s focus on busi- 
ness. There are nine more objectives to get 
through.” Vriers glanced at his phone, sat 
atop one of the piles of papers. “It’s almost 
three oclock, and I’ve got a meeting at four. 
So let's get on with it” 

As he picked up his pen, the phone started 
to ring. He answered it and a draught sent 
the documents billowing across the room. 

“Are you serious?” Vriers asked, sur- 
rounded by the dancing sheets. He waved 
them away, the colour draining from his face. 
“When did this happen? Who left it turned 
on? Okay, I'll be right down” 

“What's the problem?” Cranmer said. 

Vriers stood up. “The chimp’s stolen the 
machine.” 

“Sorry, what?” 

“The chimp. He’s vanished with the 
machine.” 

The room fell silent. 

“Come on,’ Vriers said, slamming his 
palms against the table. “Don't just sit there! 
We need to get it back! Think, what are we 
going to do?” 

Garcia smiled. “Put it on your list?” m 


Adam Kucharski is based in London, 
where he researches the cocktail of influences 
behind disease outbreaks. 
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